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PERSONALITY STRUCTURE IN FOUR AND FIVE YEAR OLDS IN 
TERMS OF OBJECTIVE TESTS 
RAYMOND B. CATTELL AND DONALD R. PETERSON 


University of Illinois 


PROBLEM 

The importance for clinical psychology of the first few years of life has been 
fully conceded, and endlessly discussed, but until measurement of temperamental 
tendencies and dynamic structural growth in this period can be achieved, little pro- 
gress can be made in exactly checking and refining existing theories. 

Multivariate, factor analytic experiment has been successful in confirming 
clinical structures at the adult “ and middle childhood °: 919 11, 12) Jevels, but a fear 
that personality structure might be too fluid and subtle at the four year level seems 
hitherto to have deterred systematic research there. However, the present writers 
have shown that parent“*®) and nursery school teacher“’) behavior ratings are 
structured by factor analysis with considerable certainty and uniqueness, and yield 
patterns which seem to be recognizably the precursors of the ego strength Cimension, 
the schizothyme temperament, surgency-desurgency, etc., at later ages. Work in 
this area has been further encouraged by the findings of reliable patterns in verbal 
questionnaire responses “®) and in time samplings of actual behavior recorded in the 
playground situation “? 

However, if these structures are to become the pivots of experimental work on 
child personality development, it is necessary to shift their measurement from cumb- 
ersome rating procedures on to a basis of easily obtainable objective tests, and it is 
with the development of such tests that the present contribution is concerned. By 
objective tests we mean not merely score-objective, ‘“‘conspective”’“ tests as in a 
questionnaire, but, rather, that variety of situational, misperceptive (‘‘projective’’), 
stylistic, etc., tests as defined elsewhere under ‘‘objective’. Parenthetically, the 
present study is part of a larger, long-term research exploring personality structure 
by objective means through all three media of observation (life behavior in situ, 
questionnaire and objective test) and at all age levels, an interim integration of 
which has recently appeared ©. 

Obviously much is going to turn, in the present endeavor, on getting tests which 
get out into the full space of the personality sphere“. The invention and choice of 
tests at this level has benefited from experience at other age levels and followed the 
same canons, namely: (a) To include a sufficiency of unchanged tests, to mark (as 
salients) the factors found in the above preceding and coordinated studies; and (b) 
to shape new tests to get at the forms of response already indicated in the behavior 
rating and questionnaire factorizations at this age to be important in covering com- 
prehensively the personality sphere. The first is most important in the initial explora- 
tion at a new age level, for, unless older, known factors can be “‘placed”’, there is con- 
fusion. But, once they have been so placed, new test invention can proceed, in order 
to yield, in later experiments, higher validities (loadings) for the now recognized and 
confirmed patterns. 

With such a basis of test selection, our design became: (a) to administer the 
battery to a sufficient sample of 4-5 year olds, (b) to discover the number and nature 
of the factors accounting for the correlations, by pursuing simple structure, if found 
inherent in the data, blindly (7.e., by statistical criteria alone) and uninfluenced by 
theoretical preconceptions, and, (c) to develop sufficient precision of hypotheses on 
these factors, e.g., through interpretation in terms of factors known at other ages, to 
make a basis for further checking researches. The same children tested here on 
objective tests have also been subjects for factoring on ratings (including time 
sampling) “%- '®) and on an oral questionnaire“®*, and the resulting factors from the 
three media are later to be brought together. 
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THe NATURE OF THE PERSONALITY TESTS 

As just indicated, it was our intention to represent each personality factor 
hitherto known, and specifically to represent each factor, old or newly-hypothesized 
bv at least two marker tests. as close as possible to the tests used at older levels, 1.€., 
these particular tests should be of a kind which one would not expect to be modified 
very much in going down the scale from eleven and seven to four years of age. By 
this means, after rotation, we might hope to identify the factors with the factors of 
the upper age level. In addition to these markers, a number of entirely new tests 
were invented especially adapted to the four year old level and concerned with be- 
havior which might clearly bring out the meaning of certain factors. The resulting 
list of variables is given in Table 1.! 


TABLE 1. List or Ossective Test VARIABLES 


The following 81 variables are indexed in order of appearance in the correlation and factor 
\atrices, as well as with a Master Index (M.I.) number permitting their location. For full particulars 
idministration and scoring, see the Test ¢ ompendium, the O-A Battery“ and other articles and 
2 describing the tests. The verbal title is kept as close to the habitual title in lab- 
publicatior sas possible (allowing for slight changes in descriptive emph isis in referring as 
is possible to the child form), and the adjective at the right indicates the addition for sign 
ng in the factor salient tables. 

The division into ‘‘main variables’’ and ‘‘secondary variables” is of no major important, but the 
former were less ‘“‘speculative’’ and the latter were often only modifications of scoring of earlier tests 

deemed more essential to defining factor structure. 


No. of 

Variable Master Main Variables High 

in Factor Index Test 

Matrix Number Title to be given to Positive Loadings Score Reliability 


l M.I. 176 Regularly warned reaction time 

(R.T Long Short 
Irregularly warned reaction time 

(R.T.) Long Short 
Reactions to false signals (R.T.) Many Few 
Ratio of irregular to regular warne 

reaction time Long Short 
Variance of simple reaction time Large Small 
Complex decision reaction time Long & Short & 

Accurate Inaccurate 

Errors of decision in maintaining 

complex reaction time sets Many Few 
tatio of complex to simple reation 

time Large Small 
Speed of decision in visual compari- 

sons Short Long 
Tempo of arm-shoulder movement Many Few 
Objects seen in unstructured draw- 

ings Many Few 
(Absolute number) threatening ob- 

jects in unstructured drawings Many Few 
Percentage threatening objects per- 

ceived in unstructured drawings Many Few 
Speed of social judgment Short Long 
Willingness to play practical jokes 

and tease Many jokes’ Few jokes 
Questionable preferences in reading Many emo- Few emo- 

tional tional 

Use of circles and slanting lines 

equivalent on C.M.S. Test. 

(Child version) Many Few 
Ratio incorrect (forbidden) to 

total number on CMS perform- Many Few 

mance (Child version) correct correct 81 


1The originals of these tests may be obtained for research purposes on application to the authors 
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M.I. 143a 


M. 


M. 
M. 


M. 
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M. 
M. 
M. 


M 
M 
M 
M 


M. 
M. 


M 
M 


M 


M. 


M 
M 


M. 


M 
M 


M. 


M. 
M. 


M 
M 


M 
M 


M. 


M. 


. 339 


. 120 


. 342 
. 340 


. 341 


Total (correct) score on CMS 
(modified) (Child version) 


30 Ratio self criticism to criticism of 
others 


Criticism of performances. 


Accuracy of perceptual Gestalt com- 
pletion 

Speed of perceptual Gestalt com- 
pletion 

Index of Carefulness 

Analytic periormance on 
chaldt figures 

Critical exactness 


Gotts- 


Tendency to agree 


Willingness to admit being naughty 
(common frailties) 
Tender cy to re call consonalit vs. 
disonant statements 
Willingness to forego prize for those 
in need 
150 Extremity of viewpoint 


34 Maturity of opinion 


.271 Fluency 


6:128 [deomotor speed (color naming) 
») 


2 Rigidity (re ) 


versed color nami g 

Performance on 

tigidity (mirror drawing) 

Performance on mirror drawing 

227 Preference for color/form in pictures 

{atio of inaccuracy to speed (pencil 

334 Ratio inaccuracy (to speed) on 
difficult to easy mazes 

144 Ace eptance of moral aphorisms 

(platitudes) 

Readiness to learn new names 


333 
211 Annoyability (all situations) 


330 Ethical choices 
47 Expansiveness of copied design 


Speed of drawing 
Speed on repetitive formboard per- 
formance (realistic ation 

omputational ill 

luency on self 

relative) 

161a No. correct performances under ap- 
proval relative to disapproval 
(comp. of lines) 

343 Rapid performance on approval 

relative to disapprov al (form- 

board) 


manipul 
199 C 
I 


273 (absolute, not 


. 3438a Faster performance under approval 


relative (Pencil 
mazes) 

Higher accuracy to speed under ap- 
proval vs. disapproval (mazes) 


to disapproval 


Many 
correct 


Few 

correct 
Many self 
criticisms 


Mar V 


criticisms 


Many criticisms 
of others 

Few 

criticisms 


High percent- Low percentage 
age correct correct 


Long time 
Few correct 


Shott time 


Many correct 


Many correct Few correct 
High pe reent- Low percentage 
correct 

Low per- 
centage 


Few confessed 


age correct 
High per- 
centage 
Many con- 
fessed 
Hig 


consonant 


High ratio 

disonal t 

Will keep 

many rewards rewards 

Many extreme Many middle 
categories categories 

Many inconsist- 

ent responses 

Few responses 


ratio 


Gives up 


Few changes 


Many 
responses 

M any named 
Reversed 


score low 


Few named 
tegular 
score low 
Fails to 

zo far 

ong time per Short time per 
unit score unit score 
Many color Many form 


maze) 


Agrees with Agrees with 
man few 

Many correct Few correct 
Many annoy- Few annoy 
(& moderate) 
Few 

Small 
drawing 
Long time 


ances 
ances 
Many 
Large 
drawing 
Short time 


Short time 


Many ec 


Long time 
Few correct 


orrect 


Many items Few items 


High approval High disapproval 


High 


approyv al 


High approval 


High approval 


High dis- 


approv al 


High dis- 
approval 


High approval 
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Taste 1. List or Opsective Test VARIABLES (continued) 








Jo. 0 
Variable Master Main Variables High Low 
in Factor Index Test Test 
Matrix Number Title to be given to Positive Loading Score Score Reliability 
54 M.I. 205 Emotionality of comment Many emo- Few emotional 
tional com- comments 
ments OS 
M.I. 97:123 Ratio passing time estimate, 
working in waiting High waiting High working 63 
56 M.I. 212 Distant goals considered attainable Few Many 67 
7 M.I. 268 Tempo of tapping Many taps Few taps 65 
8 M.I. 31 Fluctuation of attitudes Many changes Few changes 44 
9 1.1. 344 Tendency to like people Many likes Few likes 19 
60 M.I. 345 Prolonged will-endurance 
(momentum) Long time Short time 93 
61 M.I. 346 Will-endurance (increase in Increase in Decrease in 
concentration) time from time from 
I-I] I - II 41 
62. M.1I. 347 Readiness to imitate Many imita- Few imitations 
tions 79 
63 M.1. 348 Critical rejection-vs-uncritical Many Few rejections 
acceptance of gener ilizations reje ctions 26 
64, M.I. 266 Ratio of initial to final performance Much im- Little im- 
provement provement 26 
65 M.I. 101 Absolute level of aspir ition High Low 06 
66. M.I. 349 Consistency of aspiration High & Low 
consistent 61 
Se condary Variables 
67 M.I. 176:263:40 Slowness (vs. Instantaneous- Long for Short for 
ness) in false reaction times few Many 44 
68 M.I. 72 Scatter in complex reaction times High standard Low standard 
deviation deviation 69 
69 M.1. 350 Total errors in shape comparisons Many Few 66 
70. M.I. 351 Shape comparison performance 
(errors and speed) Many errors Few errors 
71. M.I. 353 Standard deviation in tempo of 
arm-shoulder movement High Low 09 
72. M.I. 352 Standard deviation (unevenness in) 
number of objects seen in un- 
structured drawings High Low 27 
73 M.I. 354 Possible responses missed on C.M.S. Many un- Few unmarked 
(Child version) marked 57 
74, M.I. 355 Caution (‘Don’t know” responses) Many “don’t Few “don’t 
in Gestalt completion know” know” 44 
75. M.I. 67 Memory for consonant statements High Low con- 
(absolute no.) consonant sonant 31 
76. M.1. 356 ~— Errors in color naming Many Few 67 
xe M.I. 356a Ideomotor speed: correct only Many correct Few correct 82 
78. M.I. 2 Rigidity: color naming (corrected Many correct Few correct 31 
for speed) 
79. M.1. 357 Venturesomeness in mirror drawing Long distance Short distance 
covered 31 
80. M.I. 359 Annoyability (extremity of Very annoyed Little annoyed 68 
response ) 
81. M.I. 358 High performance on comparison Many correct Few correct 88 


of lines 


CONDITIONS OF THE EXPERIMENT 
Due to the excellent and understanding cooperation of the University of 
Chicago Nursery School, for which we particularly wish to thank Mrs. Frances 
Prindle and her staff, it was possible to obtain four hours (eight half-hours individual 
sessions within a couple of weeks) of testing with each of 80 children. The choice of 
children was kept to as narrow an age range as possible, since we did not wish the 
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factor picture to be complicated through different maturation factors being created 
by the age variance. Both boys and girls were taken, 36 of the former, 46 of the 
latter, since we wanted to get factors general to the sexes. The children averaged 4.7 
years of age, with a sigma of 0.4. They were somewhat selected for socio-economic 
status of parents, and, according to records, averaged well above the mean intelli- 
gence of the general community. However, they were mixed to the extent that there 
was a proportion of colored children among them, and, in personality, there seems no 
reason to suppose that any radical selection, of a kind that would completely ob- 
literate a factor, can arise through social status or intelligence selection of a moderate 
kind. 

Two experimenters carried out all the testing, about half the children being 
tested by each. They made themselves acquainted to the children before the test 
period, being several weeks on the premises, and the children were in all cases happy 
to go with them to the test room where they were put through the series of tests 
under the description of ‘‘games’’. The total test session was divided up for each 
child into eight parts, commonly occurring on different days, (but not remote, to 
avoid function fluctuation effects) with each part of about 30 minute duration. All 
of the objective tests were given in an individual test situation and in the same room. 
Wherever possible the tests were so arranged that each score could be split into two 
halves, similar to a split-half consistency coefficient, so that their reliabilities could 
be determined and allowed for in interpreting the subsequent factor structure. 

In discussing these reliabilities it must be realized that we have deliberately 
entered this experiment with very brief tests (as we did throughout the whole re- 
search program) in order to find out the more promising tests by an initial broad 
“blocking in”’ of the factors. This permitted us to get 43 different tests, permitting 
81 different behavioral scores into the available 4 hour total testing, by allowing 
only an average of 5 minutes per test. It is proposed hereafter to carry out, as we 
have done with the adult group, an improvement of the reliability of those tests hav- 
ing encouraging factor validity, by item analysis and lengthening of the test. How- 
sver, for purposes of finding factor structure, reduction of loading by moderately 
low reliabilities is unimportant. Such reduced reliabilities lower the loadings on most 
factors, ‘‘across the board’’, but still reveal the essential factor patterns—the real 
desideratum at this stage. 


THE Factor ANALYTIC PROCEDURE 

The test scores were intercorrelated by the product-moment formula and the 81 
by 81 questionnaire matrix was then factor analysed by the centroid method. Tuck- 
er’s criterion “) and the new criterion worked out by Scheier, were tried in an attempt 
to decide exactly how many factors should be extracted. They pointed respectively 
to 17 and 19 factors. So, following the usual principle of extracting one or two factors 
in excess of the indicated number, and rotating them out thereafter, we enter the 
rotation process with 20 factors. 

It is well known that factor analyses that are otherwise entirely sound and 
impeccable in procedure may become completely worthless through inadequately 
carried out rotation to simple structure“. Accordingly, a great deal of time was 
spent here in searching for the best possible structure, and in testing its significance 
when found. After the initial analytical oblimax rotation,” thirteen further, overall 
visual, single plane, scope plot rotations were carried out, reaching at one point 67% 
of variables in the + .10 hyperplane, but dropping, with adjustment of angles, to 


*The oblimax analytical rotation method, developed by this laboratory in collaboration with K 
Dickman and J. R. Saunders, maximizes the fourth powers, accepting oblique positions (whereof the 
orthogonal position is a permitted special case). However, experience has shown that both quartimax 


and oblimax do not rotate a lequ itely to simple structure, though the latter is better. The analytical 


methods bring the rotation to a point where one can more readily see, in visual plots, what hyperplanes 
exist and what moves are necessary to converge upon them, but oblimax initially leaves the factors in 
a too oblique relationship, due to a few very highly loaded variables pulling the reference vector away 
from the perpendicular to its hyperplane. 
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This gives a significance of simple structure, by Bargmann’s index", of be- 


59% 
5% and 1%, but the significances are discussed afresh for individual factors 


( 


tween . 
below. 
DESCRIPTION OF THE SouRcE TRAIT FINDINGS 

As usual, we shall set out the salients for each factor, from the rotated simple 
structure matrix (A. D. 1. No. 5707) running down typically to correlations of 0.2, 
which, although well clear of the hyperplane, may be taken as the lower limit of 
adequate significance. However, where it is necessary to see what has happened to 
supposed markers for the factors in question, lower loadings will sometimes be taken. 
The marker variables for the factors hypothesized to be present will be written in 
italics. Factors are numbered and presented in order of declining contribution to the 
total variance of the 81 variables here studied. 


TABLE 2. Factor 1: GENERAL INTELLIGENCE* 

Description Loading 
—6§2 
—6l 
—40 
+-39 
+37 

37 


Correct and valid shape comparison decisions 

Few errors in shape comparison decisions 

Low tendency to agree 

Good performance on comparison of lines 

High index of carefulness 

Quick learning of new names 

High ratio of irregular to regular warned reaction time 

Little critical exactness in correcting account of own performance 
34 High ideomotor speed (color naming) 
25 Good analysis on Gottschaldt figures 


*Note: This factor has been reversed relative to the matrix, for simplicity of interpretation. 


This factor loads all the tests which would be expected to have Spearman’s 
“‘g’’ in them, and also has the pervasive, large variance that one would expect of 
general intelligence in a battery of fairly complex tests applied to very young child- 
ren. Its identification is confirmed by a correlation of —.8 (the factor happened to 
emerge here in reversed polarity) with the Children’s Binet mental ages, a correlation 
not approached by any other factor. We set it aside here before proceeding with the 
personality factors proper. 


TaBLeE 3. Factor 2: WILLED-RESPONSIVENESS U.I, 29 
Loading 


—66 
+58 


ae 


Variable Description 
2 Fast irregularly warned reaction time 

8 High ratio of complex to simple reaction time 

l Fast regularly warned reaction time 
Low ratio in correct to total responses on CMS pe rformance (¢ ‘hild version ) —48 
No excessive improper use of circles and slanting lines (equivalent) on CMS 


(Child version) 
High total (correct) score on ( MS (Chald ve rston) 
Decline of accuracy to speed ratio, under approval, relative to disapproval 
High speed of perceptual Gestalt completion 
Low variance of simple reaction time 
Low absolute level of aspiration 
High ideomotor speed in color naming correct only 
High tdeomotor speed (color naming) 
Many reactions to false R ry signal 
Lou grdrty reversed color naming ) 


*Reported for completeness; but of low reliability. 
Markers for U.I. 29 in italics 
8The rotated reference vector matrix, V13, and the angles among the reference vectors, C13, are 
available, along with the unrotated factor matrix VO and the transformation matrix T13, on micro- 
film from ADI under the number 5707. Send $1.25 for photostat or $1.25 for microfilm. 
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This pattern is comparable with U. I. 22 and U. I. 29 and no other, in the known 
adult and 11 year old series. In spite of the very rary loading in simple reaction time, 
which normally characterizes U. I. 22, it is identified as U. I. 29, because the rest of 
the pattern of U. I. 29 is here, and because the next factor fits U. I. 22, though with 
somewhat lower absolute loadings on reaction time. 


Variables 19, 3, 35, 34, and 77 have salient loadings, 7.e., loadings out of the 
hyperplane, here as required by the U. I. 29, over-responsiveness pattern. Speed of 
talking, present in the adult U. 1. 29, fails here, while 4 and 50 fall in the hyperplane 
for the obvious reason that they have almost zero reliability. The general character 

“keyed up” behavior persists in the new variables, but here one might question 
‘‘over-responsiveness’’ since the performances are almost all in the direction of 
efficiency. Along with the GSR and metabolism associations in the adult patterns © 
one gets still, wri ver, a concept of eagerness and ¢ apacity to mobilize r: — one’s 
resources. That is almost cert: uinly a motivational source trait, most probably, as 
proposed elsew “a 6), connected with over-compensatory behavior for inferiority « 
insecurity. 


TABLE 4. Factor 3: CorticaLertia U.I. 22 


Variable Description Loading 


Li ow variability in complex reaction time —68 
Fast complex reaction time —56 
High speed of repe titive formboard performance (realistic manipulation) +56 
Low variance of simple reaction time 56 
Low ratio of inaccuracy to speed (pencil maze) 

High fluency on self (absolute, not relative) 

Fast regularly warned reaction time 

Fast irregularly wee tion time 

\lany errors in maintaining complex reaction time sets 

Many reactions to false R.T. signals 

tapid performance on approval relative to disapproval 

High ideomotor speed (color naming 

Many objects seen 1n unstructured dra yngs 

Low dynamic momentum (will endurance) 


All the markers for U. I. 22 appear here (and nowhere else except, in very in- 
complete form, in Factor 2, above) with correct signs and mainly with substantial 
loadings. It continues at the 4-5 year level to have the quality defined in adults as 
Corticalertia “*’, namely, a high basic neural speed, with low variability, but tend- 
ency to “go off at half cock’’. The riddle of its distinction from U. 1. 29 remains. 
Both are largely environmentally determined patterns, as to level“, and both show 
a dynamic quality of eagerness and response to social approval, etc. The hyperplane 
fit falls below the P = .05 level here, but this is characteristically a pervasive factor. 


“ABLE 5. Factor 4: REALISM vs. SUBJ 


ariables Description Loading 
High wleomotor spee 1 (cole naming 
Higi ideomot speed in col naming (co 
High rigidity (reversed color naming) 
Low fluency on self (absolute, not relative) 
High fluency 
High willingness to forego prize for those in need 
Low speed of drawing 
Low fluctuation of attitude 
Low speed of social jJuagment 
Low extremity of viewpoint 
Few “‘decisions’”’ (child equivalent) missed on ( 
High computational skill (Child version: comparis 
I ow accuracy of pe rceptual Gestalt completior 


] kh) a Y lan slain f “07 sed j 
0 atvo passing time estimate, occupied in rd le 
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This is the only factor in the series with definite marker resemblance to U. I. 25, 
Realism, though it holds only for 34, 48, 22, and 55, the other required salients, 23, 
15, and 57, falling in the hyperplane. The curious mixture of high speed on realistic, 
exact performances, e.g., computation, with rigidity in more creative or, at least, 
subjective response, e.g., drawing, fluency on self, gestalt completion, social judge- 
ment, low attitude fluctuation, remains, and agrees with the Realism-vs.-Subjectivity 
(Eysenck “psychoticism’’) interpretation of U. 1. 25“. However, both this factor 
meaning and the hyperplane are not well defined. 


TABLE 6. Facror 5: Anxiety U.I. 24 


Description oading 
High annoyability (all situations) 79 
Many “‘extreme’’ annoyance responses 76 
Willingness to admit being naughty (common frailties) 28 
Low speed of repetitive formboard performance (realistic manipulation) 27 
Low extremity of viewpoint 26 
Slow complex decision reaction time +26 
Low speed of decision in visual comparisons 23 
High readiness to learn new names 20 
Higl emotionality of comment +16 
Low fluctuation of attitudes 
High tende ncy lo agree 


The now well established Anxiety, U. I. 24 pattern“: '), has resemblance in- 
itially here to factors 5 and 14, but there are good reasons for preferring the former, 
despite variable 16 loading in the wrong direction (discountable in view of its essen- 
tially zero reliability \ll remaining markers have the right sign, though some are 
of very small magnitude compared with the adult pattern, and indeed “Tendency to 
Agree” virtually fails. Further experiment might be done with rotations of this 
factor. Variable 42, High Readiness to Learn, perhaps is the forerunner of the Good 
Ethical Choice and Absence of Questionable Reading Preferences which load the 
factor in later life. An objective anxiety battery for pre-school children has been 
added, by present results, to those available at higher ages “4 


TaBLeE 7. Factor 6: NEURAL RESERVES-VS.-NEvROTICISM U.I. 23 


Description Loading 


High liability to error (relative speed) in “creative effort’’ (Rigidity function) -63 
Many errors in rapid color naming 

Low absolute level of aspiration 

Lou preference for color over form in picture 8 

High variability in number of objects seen in unstructured drawings 
Low rigi lity (reversed color naming) 

Low ratio of inaccuracy to speed (pencil maze) 

Low speed on repetitive formboard performance (realistic manipulation) 
Low speed of drawing 

Slow performance on approval relative to disapproval (formboard) 

High will-e nm lurance (se cond tr ial) 

High computational skill (Child version: comparison of quantities) 

High fluency 


Factors with any resemblance to Neural Reserves-vs-.Neuroticism, U. I. 23, 
are 10, 19, and the present pattern; but this alone clearly has the main markers © 
However, ‘‘Low absolute level of aspiration’ must be discounted because of its low 
reliability, and ‘High fluency”’, though on the right side of the hyperplane, is neglig- 
ibly loaded. It is interesting that the child scoring highest in the non-neurotic 
direction is most prone to careless errors in color naming and to low aspiration. The 
hyperplane is at the P = .05 level. 
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Tasie 8. Factor 7: Harric ASSERTIVENESS: U.I. 16 


Variable Description Loading 


High absolute level of aspiration +56 
Many distant goals considered attainable —47 
High consistency of aspiration 37 
Much acceptance of obvious moral aphorisms (platitudes) 

Low tendency to agree 

Incre ased will-endurance (second trial) 

Good shape comparison performance 

Little critical exactness in correcting account of own performance 

Fast tempo of tapping 

Fast compl xX reaction time 

Tendency to like people 

Good analysis performance on Gottschaldt figures 

High speed of decision in visual comparisons 

Low ratio self criticism to criticism of others 

High ratio of inaccuracy to spec d (pencil maz 

Good ethical chowes 


Possible rival pattern matches for U. I. 16 were factors 4 and 6. It is note- 
worthy that this largest variance personality factor, U. I. 16 in adults, is large here 
also. A few possible adult markers: 11, 17, 19, 35, 77, and 81—mainly concerned with 
willed speed (and contact)—fail to load here, and it is brought out more clearly than 
in the adult that the factor is concerned more with tempo than with speed. However, 
there is the same central hypothesized character“: °- *56 of ‘expressiveness and 
assertiveness, with fast tempo,”’ here showing in high aspiration, will endurance, 
little tendency to agree, and both kinds of tempo. There are also associations fitting 
the correlations found“ with the 16 PF questionnaire factor of Harria. Finally 
there is the usual emphasis 6, p. on good social values Variables 41 and 44). 


TABLE 9. Factor 8: Sroiipiry, CooLnt IN StrrEss: U.I. 30 


Description Loading 
Low ratio incorrect (forbidden) to total re sponses on CMS pe rformance 
(Child version) —51 
No excessive use of circles and slanting lines (equivalent) on CMS 
(Child version) —47 
‘ts seen in unstructured drawings 43 
correct) score on CMS (Child version) 
in complex reaction times 40 
irance (dynamic momentum, physical str —39 
Few threatening objects in unstructured drawings - 
Low readiness to mimic —28 
Slow tempo of arm-shoulder ‘movement - 1 
Lou fluctuation 0} utitu le ‘ 13 


a 


This is the only factor containing the markers for U. I. 30 with the right sign 
combination; but its identification must be tentative because it also contains var- 
lables, notably the C. M.S., not pre\ iously found to load I 1. 30. Its sense fits the 
“stolidity”’ title assigned to U. I. 30° and suggests steadiness, and coolness in 
stressful situations. 


TABLE 10. 











f decision in visual comparisons 
decision reaction time 
lute no.) threate ning bjects in unstru 
preierence tor color over form in pictures 
Low ratio of « omplex to simple reaction time 
Many objects seen in unstructured drawings 
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TABLE 10. Factor 9: AuttA-vs.-PRAXERNIA: U. I. 30 (continued) 


Description Loading 


High willingness to forego prize for those in need 

High fluency 

Good shape comparison decisions 

Much criticism of performances (own and others) 

Many “ertreme’’ annoyance responses 

Many questionable preferences in reading 

Many errors of decision in maintaining complex reaction time sets 


This is a very good match for the U. I. 34 pattern which Rosenthal located “®, 
and Scheier confirmed “*, since the original survey of replicated factors“). The 
character of the present new, non-marker variables is, moreover, fully consistent 
with the identification as M, autia,“’, made by Scheier“*), namely, imaginative, 
fluent, quick, but fearful, easily critical and introverted. It is increasingly apparent 
that M, autia, despite its late discovery, is a personality factor of considerable stab- 
ility of form, and practical importance. 

TaBie 11. Facror 10: Critica, Restraint (NEw). Posstpiy U.I. 26 


Description Loading 


ligh accuracy of pe rceptual Gestalt completion +55 

caution (‘‘Don’t Know’’ responses) in Gestalt completion +55 

1 ratio self criticism to criticism of others +-38 

ew correct performances under approval relative to disapproval 

(comp. ol lines) 32) 

ncreased will-endurance (second trial) +31 

rh emotionality of comment 28 

ow variance of simple reaction time 24 
w errors in shape comparison decisions 24 
lency to dislike people (in portraits) 2 
Few errors of decision in maintaining complex reaction time sets —2i 


This has some resemblance to U. I. 17, 23, and 29, but most to U. I. 26. How- 
ever, the agreement is not good enough to conclude any definite match and it is 
tentatively called, descriptively, Critical Restraint, and left to future verification as a 
pattern (incidentally, with a better hyperplane than here) before theorizing. This 
is at least highly consistent in its variables, depicting strong will, exactitude, criti- 
cism of self and others, and cautiousness. Its resemblance to the sense of U. 1. 17 is 
great, yet the markers for U. I. 17 fall in Factor 12, so we must hypothesize two 
sources of inhibition. 


TaBLe 12. Factor 11: Criticat Practicatitry: U.I. 19 


Description Loading 


Low percentage threatening objects perceived in unstructured drawings —5l 
Few errors of decision in maintaining complex reaction time sets —4h 
High accuracy of perceptual Gestalt completion +42 
High extremity of viewpoint +42 
Few reactions to false R.T. signal 

Few errors in rapid color naming 

High index of ca efulne ss (Picture recognition) 

ligh variability in complex reaction times 

Few errors in shape comparison decisions 

Low trial and error a tivity in mirror drawing 

Good analytic pe rfo “mance on Gottschaldt figu res 


Low ratio self criticism to critism of others 





This is an excellent match for adult and 1l-year old Critical Practicability, 
I. 19, having all definite markers equated and even less exact markers, e.g., low 
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tendency to imitate, few slanting lines, in CMS, and low rigidity, correct in sign. 
The new variables bear out the character of accuracy and penetration, with outward 
direction of aggression. 


TABLE 13. Factor 12: Inursrtion: U.I. 17 


De scription Loading 


Willingness to play practical jokes and tease 
High variability in no. o 
Slo 


objects seen in ur 
v speed of perceptual Ge stalt completion 

Preference for form over color in picture 

Low variance of simple reaction time 

Low variability in tempo of arm-shoulder moveme 

Many objec n in unstructured drawings 

Many ibsolute no.) threaté g 


n ing objects in unstr 


Little criticism of performances (own and others) 


Few questionable pre 


tel 
High 


This and this alone, adequately carries the markers for U 17 though, as 
usual, a slight cooperativeness 1s seen with U. I. 26 (Factor 19 here Only the slow- 
ing of reaction time with complex instructions fails to load sufficiently, though the 
difference is in the right direction. The new variables mainly bring in a low varia- 
bility in speed and tempo. However, the emphasis has changed from that among 
adults. Thus variable 15 which hypothetically indicates an outlet for the highly 
inhibited see Oliver Cromwell) comes higher, while 13 (as found in 11-year-olds 
comes lower in the pattern “’. The hyperplane significance is at P = .O1. 


TABLE 14 4 yn 13: Mrrror-D 


Variable Description Loading 


37 Slow and defective performance on mirror drawing +70 
36 Poor final achievement (orientation rigidity) in muirr 
61 Increased will-endurance 
16 Few questionable preferences in reading 
; 

4 Low emotionality of comment 


second tri 


No match can be found for this, which is scarcely surprising, seeing that most of 
its variance arises artifactually, as a doubtlet from two ways of scoring the mirror 
drawing behavior, but its hyperplane is sound (P 01 


cTOR 14: CoMENTION-Vs.-ABCULTION: U.I. 20 


Des \riptior 


1} 


ally to reject loose ge 

lo agree 

» admit being naughty (con 
ry in Gottschalat hg 
lurance (second trial 

lling to play practical jokes a 

criticism of performances (own 

irm-shoulder 
jects seen in u 


unstru 


With the highest of all loadings on 27 and 28, the two highest marke 


rs for U. I. 
20, 


Maturity of Opinion had zero reliability here) there can be little doubt in identi- 
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fying this as U. I. 20, though the hyperplane is relatively poor (52%). Yet there 
exists a not previously noted cooperativeness with U. I. 30 (slight and on two var- 
iables) and resemblance thereto in terms of ‘‘stolidity’’ and “independence.” The 
main meaning remains in accordance with our hypothesis of a general cooperativeness 
and tendency to accept, and impose on the self, good cultural standards “?, 


TABLE 16. Factor 15: Exuperance: U.I. 21 


Variable Description Loading 


High speed of social judgme nt +49 
Tendency to recall dissonant rather than consonant statements 48 
High speed of perceptual Gestalt completion 16 
High fluency +37 
High speed of decision in visual comparisons 30 
Many errors in shape comparison decisions 
Many agreed responses remembered 

Fast tempo of tapping 

High readiness to mimic 

Many “‘extreme’’ annoyance responses 

H gh wdeomotor speed (color naming ) 

High tende ney to agree 


+- -+- - 
eS NO 
08 Co Re RS. 1 OO 


+ 


+ 
hm rm DO DO OS 





Unfortunately, the highest marker for U. I. 21 (No. 50 here) had to be rejected 
(Table 1) for low reliability; but otherwise the markers for U. I. 21 appear very con- 
sistently here (and nowhere else, except for slight resemblance to Factor 4). (Compu- 
tational skill is positive, but falls into the hyperplane. Its failure is presumably our 
failure to construct a 4 year old equivalent for computation.) The descriptive 
hypotheses of ‘“‘exuberance”’ is continued “ »- “®) in the new variables, e.g., many 
errors, readiness to mimic, extremity in statements of annoyance. 


TaBLeE 17. Factor 16: Exavia-vs.-Invia: U.I. 32 


Variable 


Description Loading 





418 High computational skill (Child version: comparison of quantities) +45 
62 High readiness to mimic -37 
67 Tendency of false reactions to occur quickly ) 

Little consistency of aspiration 

Slow performance on approval relative to disapproval (pencil maze) 

High fluency on self (absolute, not relative) 

Low extremity of view point 

Accurate and fast performance on comparison of lengths 

Low caution (‘Don't Know’’ responses) in Gestalt completion 

High ratio of irregular to regular warned reaction time 

Low ratio self criticism to criticism of others 

Many objects seen in unstructured drawings 

No subjective, excessive use of circles and slanting lines (equivalent) on 

CMS test (Child ve rsion) 
Low accuracy of perceptual Gestalt completion 








Four markers existed in this experiment for U. I. 32, now firmly identified as 
what is commonly called extraversion-vs.-introversion, a second order rating and 
questionnaire factor, ‘ »?- 16. 17), They fall here consistently but with low loadings, 
as shown, in a factor which has rather surprisingly small variance. Its hyperplane is 
also poor. Presumably, the large change in age has brought a reduced projection, 
while recognizably retaining the pattern, and the measurement of the exavia 
factor will need to be transferred here to new variables. The new variables, however, 
fit very well the notion of attention to external reality, readiness to mimic, lack of 
caution, and self criticism, and impulsiveness and lack of consistency, in extra- 
version. a a - 








PERSONALITY STRUCTURE IN FOUR - FIVE YEAR OLDS IN TERMS OF OBJECTIVE TESTS 367 


TaBLE 18. Facror 17: U.I. 35 
Variable Des sc ription — Loading 


55 Low ratio passing time estimate occ upied ir in idle pe riods —48 
58 Low fluctuation of attitude —39 
18 Low computational skill (Child version: comparison of quantities) —33 
10 Low increase in accuracy (to speed) on passing from easy to difficulty maze —33 
81 Accurate and fast pe rformance on comparison of lines +30 
26 Little critical ex: ictness in correcting account of our performance —28 
73 Many “‘decisions’’ (child equivalent) missed on CMS +28 
36 Poor final achievement (orientation rigidity) in mirror drawing +25 

$ High ratio of irregular to regular warned reaction time +19) 
43 High annoyability (all situations) +15 
49 High flue sncy on self (absolute, not relative) +10 


The pattern of U. I. 35 has only been recognized and replicated, in adults, re- 
cently by Scheier. Three of the four markers can be accepted as replicating the 
pattern here in children, with reduced projection however, and therefore permitting 
us to indicate this only as a tentative pattern in children, awaiting confirmation in 
terms of the better variables here found. 

TaBLE 19: Factor 18: AstHenta (witH Goop CompLEX PERFORMANCE): U.I. 28 


Variable Description Loading 





Many agreed responses remembered (Consonant statements) +46 
Slow performance on approval relative to disapproval (formboard) —33 
Many reactions to false R. T. signals +32 
Fast complex decision reaction time —31 
Excessive use of circles and slanting lines (equivalent on CMS test 

(Child version) +-31 
Low speed of decision in visual comparisons 28 
Low readiness to mimic 
Low ratio of complex to simple reaction time 
High speed of drawing 
Slow tempo of tapping 
High ratio passing time estimate occupied in idle periods 
Few distant goals considered attainable 
High fluency on self (absolute, not relative) 
High annoyability (all situations) 
High tendency to agree 
Te sndency to recall consonant vs _ disonai int statements 


The markers, in descending order 55, 27, 43, 49, 29 for U. I. 28 appear here, and 
only here, simultaneously with loadings outside the hyperplane. Although small, all 
five are correct in sign, and if 29 is combined with 75, as might be appropriate, the 
cummulate significance is high. However, U. I. 28 remains here, as in other studies, 
a factor of low variance, poorer hyperplane and elusive meaning. ( The interpretation 
favoring a combination of emotional im pulsive ness with asthenic qualities, of high 
speed on some complex performances with slow te —_ etc.) It deserves special in- 

vestigation, in terms of finding one or two substantial ly los aded markers to aid inter- 
pretation. 


TaBLe 20. Factor 19: SetF Sentiment (Contro.): UI. 26 


V wl le Description Loading 
64 Low ratio of initial to final (simple) pe rformance 4.57 
35 Low rigidity (reversed color naming) 
78 Lower liability to error (rel. speed) in “creative effort’”’ (rigidity function) 
42 Readiness to learn new names 
34 Low ideomotor speed (color naming) 
77 Low ideomotor speed in color naming (correct only) 
13 High percentage threatening objects perceived in unstructured drawings 
37 Opposite: Slow and defective performance in mirror drawing 
High will-endurance (second trial) 
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This can be considered only fairly good as a match for U. I. 26, because the 
highest marker in the latter ‘‘high ratio of fluency on self to other topics,” was not 
ivailable as a ratio here ~~ match rests, therefore, on 60, 35, 13, and 64 (that order 
in original “)), while 7, 32, 38, 50, 54, and 75, despite having only one exception 
to correct sign pattern, are never clear of the hyperplane. If the hypotheses“? that 
this represents the self-sentiment is correct, one might expect rather large modifica- 
tion of the pattern between 4 and 11 years. The hyperplane fit is very good: P < .001. 

TaBLE 21. Factor 20: U.I. 18 


Variable Description Loading 
O04 Li aiio of U7 ital to fi nal (simple) performance + 5/ 
57 Slow tempo of appl g —33 
56 Few distant goals considered attainable +98 
| Not willis g to play practical jokes and tease 26 
] Hig u (correct) score on CMS (Child version) +y} 
3 ee rigidity (reversed color naming) —23 
74 Low caution (“Don’t Know’’ responses) in Gestalt completion —23 
} High expansiveness of copied design +22 
29 Higl ratio of consonant to dissonant opinion recalled 116 
I Excessive use of circles and slanting lines (equivalent) in CMS test 

(Child version 115 


This is the only possible match for U. I. 18, but although the markers occur 
together, No. 29 is reversed in direction. However, the hyperplane is very significant 
and the general sense (note 56, 35, 74) fits what, for lack of better terms has been 
called the hypomanic tendency in U. I. 18. And ‘“‘expansiveness of copied design’’ 
may perhaps be counted a rough equivalent for the marker, not used in this study, 
“high speed on finger maze.” 


SUMMARY AND CONCLUSIONS 


1. Ejighty-one behavioral response measures on objective tests chosen to 
mark, with suitable modific ation, factors known in ok ler subje cts, and covering the 
general personality sphere, have been fac — on eighty 4 or 5 year olds, yielding 
twenty factors revealing a Paweh structure significant at the P < .05 to < .01 level. 


2 Omitting inierences from loadi ings a a dozen variables of low reliability 

the loading patterns 1n the remainder can be matched with older child and adult 
personality factor patterns U. I. 16 through 35, leaving only U. I. 27, 31 and 33 un- 
matched in the standard replicated personality factor series and only numbers 10 
and 13 unmatched in the present series. Number 13 is virtually only a doublet, but 
10 is a new factor of definite psychological consistency, called Critical Restraint. 
3. The matching is very definite in fourteen factors, in that: (a) all markers 
are present, most with non-hyperplane loading, and all with correct sign, (b) no 
truly competitive alternative match exists on either side, and (c) the new, non-marker 
variables fit the psy‘ hological sense of the factor as previously hypothesized. 

However, in tour others the matching can be considered only tentative, and in 
all there is a tendency for the marker pattern to appear with reduced variance, 
probably showing greater variance loss than in previous cross validations confined 
to groups of the same age as that on which the factors were first recognized. Also, 
the general rank order of variance magnitude of the factors is only slightly related to 
that in adults. Accordingly, we may tentatively conclude that there are systematic 
changes in the factor pattern emphasis between four years of age and adulthood, 
though the number and nature of the factors stays remarkably constant. 

4. Tocheck the identifications it is desirable that this study be independently 
repeated on a new 4-5 year old sample, but it would be valuable to do this while: (a) 
increasing the length of tests which proved deficient in reliability, (b) shifting the 
matching emphasis in determining relation to the present study to new variables 
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found loaded higher here than at upper ages, (c) examining the hypotheses by in- 
serting new tests aimed at the essence of present factors, and, (d) observing the 
agreement of the present nursery school group modification from the adult-marker 
patterns with the modifications that appear in our study of 7 and 8 year olds, now in 
process of publication “ 


The factors checked here—but especially the general anxiety factor U. I. 24, 
the general neuroticism factor U. I. 23, and the extraversion-introversion factor, 
U. 1. 32—should provide a useful basis of determinate, uniquely-defined measures 
for those concerned with studies of personality development in early childhood. A 
best possible battery for twelve factors has been put together from the tests in the 
present research, and will be made available to clinical researchers on request. 
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THE MEANING OF A POSITIVE SELF-CONCEPT* 
FREDERICK M, JERVIS 
University of New Hampshire 


PROBLEM 


There is general agreement, both in self-concept theory and in experimental 
evidence, that a negative self-concept indicates stress and tension within the in- 
dividual. In contrast, interpretations of the significance of a positive self-concept 
have varied. A large body of research, identified with Carl R. Rogers and the 
Chicago School, was based on the assumption that a positive self-concept indicated 
a well-adjusted individual. Both in theory and research design, this group stressed 
the importance of self-integration with the implicit assumption that the well- 
integrated individual would also be integrated into his society. 

Rogers and Dymond“? have recently suggested that the positive self-concept 
may also indicate a highly defensive paranoid individual. Frenkel-Brunswik ®) and 
Hart offer further support of the theory that positive self-concept, for some in- 
dividuals, may be compensatory in nature. Hart recognized the difficulty in differen- 
tiating between true self-esteem and inflated self-esteem. The possibility that high 
self-esteem can compensate for underlying negative attitudes toward self has im- 
portant theoretical implications. 

The purposes of this study are to examine the heterogeneity of the positive 
self-concept group, and to determine whether homogeneous sub-groups are related 
to external criteria. A positive self-concept is defined as a small discrepaney between 
self and ideal-self ratings. It was hypothesized that quantitative differences exist 
within a group of individuals expressing positive self-concept. 


METHOD 


Self-Concept. The experimenter constructed a Self Description Inventory (SDI), a measure of 
self-concept which is expressed as a discrepancy (D), and which provides in addition, self (S) 
and ideal-self (I) scores for each subject providing a basis of identifying quantitative differences 
within the positive self-concept group. The portion of the SDI used in this study consists of 60 
items rated on a five point scale (a) as the individual describes himself and (b) as the individual 
would ideally like to be. The discrepancy between the two responses constitutes the score for each 
item; item discrepancies are totaled to yield a measure of self-concept (D). Reliabilities of the 
three SDI scales are shown in Table 1. Detailed information about item selection, derivation of 


TABLE 1. RELIABILITIES OF SDI ScaEs 


SDI Scale N Split-half N Test-retest* 
Discrepancy 110 .93 li .68 
Self 100 .90 213 .83 
Ideal-self 100 .90 21 .39 








*Two months elapsed time. 


self and ideal-self scores, and reliabilities of parts of the scales can be found in the author’s doctoral 
dissertation “?, 
Positive Self-Concept. The distribution of SDI discrepancy scores was divided into quarters, 
The highest quarter (D*) was defined as the negative self-concept group. The lowest quarter 
(D') was designated the positive self-concept group and was treated as a discrete sample. Self 
scores of D! (positive self-concept) individuals were ranked and the distribution was divided into 
thirds. The lowest third of the self scores was designated s', the middle third was designated s*, and 
the highest third was designated s*. Individuals with positive self-concepts were also ranked ac- 
cording to ideal-self scores and discrepancy scores. These rankings were divided into thirds and 
designated similarly: i’, i*, i*, and d', d*, and d*. Every individual with a positive self-concept 
was in one of the three categories on each of the scales. 
*This article is based on a doctoral dissertation submitted to Teachers College, Columbia Uni- 
versity. Members of the dissertation committee were Professors Laurence F. Shaffer, Chairman; 
Arthur T. Jersild; and Paul E. Eiserer. 
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Attitudes Toward Others. Theoretical concepts: * 1°) as well as experimental findings: % 4. 18) 
support the thesis that positive attitude toward self is correlated with positive attitude toward 
a, Attitude sousel others was therefore selected as an external criterion for differentiating 
among subgroups of individuals with positive self-concepts. The Chicago Self-Others Scale which 
measures attitude toward a “‘generalized”’ other was used in this study. 

Predicted Academic Achievement. A possible criterion for differentiating individuals with positive 
self-concepts was the accuracy with which they predict future achievement. Subjects predicted 
their academic achievement on a seven point scale. In order to obtain a difference score between 
predicted and actual achievement, grade point averages of subjects were given a value according 
to their position on & seven point scale. 

Academic Achievement. Academic achievement was selected as a criterion which might differen- 
tiate subgroups of individuals with positive self-concepts because of theoretical indications that 
highly defensive individuals may be characterized by low achievement ), 


Subjects and Administration of Tests. The SDI was administered as part of the 
Orientation Week testing program to the 850 freshmen entering a state university in 
1956. Before students had entered their first examination period, they were asked to 
predict their relative academic standing for the first semester (N = 761). The 
Chicago S-O Scale“ was administered during the second semester of the subjects’ 
freshman year (N = 563). Grade point averages were available in the Recorder’s 
Office for 791 of the students who completed the fall semester. 


RESULTS 
Interrelationships of SDI Scales. Intercorrelations between the three scales were: 
Discrepancy-Self r = —.57; Discrepancy - Ideal-self r = .30; and Self - Ideal-self 
r= 37. 

Women’s D scores were slightly higher than men’s (significant at the .05 level 
of confidence). The ideal-self scores of the women were also higher than those of the 
men (significant at the .01 level of confidence). The higher discrepancy scores for 
women were accounted for by their higher ideal-self scores. See Table 2. 





TABLE 2. SEx DIFFERENCES ON SDI Scatzs 


Measure Men (N = 556) Women (N = 294) t 
M SD 





Discrepancy 7.96 22.6 51.6 21.7 
Self 217.6 2% 218.9 23 
Ideal-self 247 .6 23.6 254.9 22.% 





*Significant at the .05 level. 
**Significant at the .01 level. 


The self scores of individuals with positive self-concept ranged from 173 to 271 
with an M of 231 and SD of 20. The ideal-self scores of the same subjects ranged 
from 169-295 with an M of 239 and SD of 24. This group was characterized both by 
low self and low ideal-self ratings, as well as by high self and high ideal-self ratings. 

Although the three SDI scales were significantly related, the coefficients of de- 
termination suggest that the scales measure separate but related aspects of an in- 
dividual’s self-concept. In addition, the heterogeneity of the positive self-concept 
group can best be demonstrated by the variety of self - ideal-self combinations which 
resulted in a small discrepancy score. 

Attitude Toward Others. Because of the lapse of time between the administrations of 
the SDI and the S-O Scale, the SDI was readministered to a random sample of 98 
students. No significant differences were found when the S-O Scale was related to 
the two administrations of the SDI. Therefore, all subjects who had taken both the 
SDI and the S-O Scale were used in the study. The discrepancy scale of the SDI and 
the S-O Scale correlated —.03. Attitude toward others was not related to self-con- 
cept. Analysis of S-O scores within D quarters showed in both D, (positive self- 
concept) and D, (negative self-concept) a significantly greater number of men scoring 
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in the highest S-O scale quarter (negative attitude toward others) than would be 
expected by chance. X? = 12.5 and 5.6 respectively, with both significant beyond 
the .05 level. Both positive and negative self-concept groups of men were charac- 
terized by negative attitudes toward others. 


TABLE 3. MEANS AND STANDARD DEVIATIONS OF S-O Scores Wiruin D! 
Suparoups (N 151) 


D! Subgroups N M SD 

392.11 42 

413.66 49 

416.87 47 

390 . 37 41 

421.59 44 

414.98 

402.97 

417.90 


107 . 96 


Table 3 shows that 81 and Se groups scored significantly higher on the S-O Scale 
than s3 (¢ = 2.75 and 2.25 respectively). Similarly, i; and ip groups scored significantly 
higher on the S-O Scale than did i; (¢ = 2.77 and 3.37 respectively). When the d 
groups were analyzed on the basis of S-O scores, the ¢ values were not significant. 

A positive self-concept, as measured and defined in this study, was not always 
correlated with a positive attitude toward others. Low discrepancy scores character- 
ized by low self scores and low ideal-self scores were also characterized by negative 
attitude toward others. 


Predicting Academic Achievement. Results disclosed no significant relationship be- 
tween self-concept and prediction of grades. Prediction of grades did not differentiate 
subgroups of individuals with positive self-concepts. The D, group differed from the 
other D groups 1n the direction of overestimating grades. Only in the comparison of 
the D, and Dy, groups was the difference significant (4 = 2.65, significant at the .01 
level). Results suggest only a tendency toward overestimation of academic achieve- 
ment by positive self-concept individuals. 

Actual Academic Achievement. There was no significant relationship between self- 
concept scores and grades. Academic achievement differentiated only women among 
defined subgroups of individuals with positive self-concepts. The s; group of women 
had significantly lower grades than the s; and s; women (¢ = 3.43 and 3.13 respect- 
ively). In addition, the grade point average of women in the s; group was signifi- 
cantly lower than the mean grade point average of freshman women (t = 3.41). 

{CE scores of women with positive self-concepts were examined to determine 
whether differences in academic aptitude accounted for the low academic achieve- 
ment of s; women. Q scores of the three s groups of D; women did not differ sig- 
nificantly. However, the L score was significantly lower for the s; group. It is poss- 
ible that the self-concepts of these individuals have operated to inhibit the type of 
learning predicted by the Linguistic Scale of the ACE. 


SUMMARY 


1. The self-concepts of 850 college students were measured and related to 
attitude toward others, prediction of academic achievement, and actual academic 
performance. 

2. Results disclosed that the positive self-concept group contained individuals 
with low self and low ideal-self scores as well as individuals with high self and high 
ideal-self scores. 
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3. Positive self-concept subgroups were differentiated by attitude toward 
others. Men and women responded differently to the self-concept scale and to the 
external criteria. 
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DIFFERENCE BETWEEN NEUROTICS, PSYCHOTICS AND 
NORMALS ON PERCEPTUAL TESTS: REANALYSIS OF 
EYSENCK’S RESULTS 


* 


STEVEN G. VANDENBERG 


University of Michigan 


PROBLEM 

Recently Eysenck“’ reported the results of a comparison of the performances 
of 20 neurotics, 20 psy¢ hotics, and 106 normal males on a variety of objective tests 
yielding a total of 91 scores. Included in this report are tables showing means and 
variances for the three groups plus one F ratio of the between-groups and the within- 
groups variances. This statistical technique capitalizes on three comparisons: 
Neurotics vs. psychotics, neurotics vs. normals, and psychotics vs. normals. It seems 
of interest to know the results of each of these three comparisons separately. Eysenck 
states that neuroses are not merely milder forms of mental illness than the psychoses. 
He contends that neuroticism and psychoticism form two orthogonal dimensions of 
personality. If this view is correct, we would expect that the means of the three 
groups would not always be in the order: normal, neurotic, psychotic. 


METHOD 
The #-test was used to determine the statistical] significance of the difference 
etween the means of the neurotics (Neur. and the psychotics (P) on each of the 
*The analysis r¢ ported here was supported by the Schizophrenia and Psychopharmacology Study, 
Research Project of the University of Michigan and Ypsilanti State Hospital, Ralph W. 
d, Principal Investigator. 
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tests, as well as between the means of the neurotics and of the normals (Nor.), and 
between the means of the psychotics and of the normals. 


RESULTS 


TaBLe 1. Tue MEAN OF THE NeEvuRotTIcs Liks BETWEEN THE MEAN OF THE NORMALS AND THE MEAN 
OF THE Psycuotics. A REVERSED ORDER 1s INDICATED BY R BEHIND THE NAME OF THE TEST. 


Measure 


a 
P-Neur 
Dark vision ( 5 mins) 1.94 3.4: 1 
(10 mins) 20 
(15 mins) 99 
(20 mins) 
(30 mins) 
Total fusional reserves (Near Vision) R 
Total fusional reserves (Distant Vision) R 
Accommodation (Binocular: blur) 
(Binocular: recovery) 
(Right eye: blur) 
(Right eye: recovery) 
(Left eye: blur) 
(Left eve: recovery) 
Livingston Binocular gauge (Subjective convergence) 
Bishop Harman Diaphragm test R 
Autokinetic reaction time 
Retinal rivalry R 
Worth 4-dot reversals R 
Street Gestalt completion test (correct responses) R 
Street Gestalt completion test (mean response time) 
PMA space R 
Peripheral span R 
Circles test (average response time) 
Circles test (retardation in judgment time) R 
Kundt illusion (slow performance) R 
Angular illusion R ‘ 
Aubert phenomenon (total difference) 
After-Image (duration of first image) R 
After-Image (average duration of two tests) R 
After-I[mage (1 - 2) R 
Prism glasses ( idaptation score) R 
Lines assessment (orderly position) 
Lines assessment (random position) 
Learning dots (trials) 
Learning dots (wrong groups) 
Square pattern (error level) R 
Expressive movement (surface first drawing) 
Expressive movement (average sidelength) 
Visual obje ct recognition (head content) 
Visual object recognition (spectacles content) 
Visual object recognition (head third dimension) 
Visual object recognition (spectacles third dimension) 
Visual object recognition (head-spectacles content) 
Visual object recognition (head-spectacles third 
aimension: combined) 
Lie scale R 
Interests R 
Dislikes R 
Sentence completion R 
Mean log time on Nufferno test 
Skew score 





*Significant at the 5% level. 
**Significant at the 1% level. 


Table 1 gives the results for those 50 tests on which the neurotics were inter- 
mediate between normals and psychotics. Table 2 gives the t-tests for 13 tests on 
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TaBLE 2. Tests WHERE THE MEAN OF THE NORMALS Lies BETWEEN THE MEAN OF THE NEUROTICS 
AND THE MEAN OF THE Psycuotics. A REVERSED ORDER IS INDICATED BY R BEHIND THE SAME OF 
rHE TEST. 


Me sure 
Nufferno Intelligen« 
Giles Rule (Objective Convergence 
Critical Flicker Frequency 
Gottschaldt figures (A) R 
Gottschaldt figures (B) R 
slock design R 
Aubert phenomenon R 
After-image (first distortion score) R 
Prism glasses R 
Square pattern (trials) R 


W ra connectl 
‘ 


\ulierno level 


> 


I rror score 
*Significant at the 5, le vel, 


/ le vel. 


**Significant at the 1 


ich the normals were placed between the neurotics and the psychotics, while 
bl lists the t-tests for 28 tests on which the mean for psychotics is between the 


Tue MEAN OF THE PsycHotics Lies BETWEEN THE MEAN OF THE NEUROTICS AND THE 
‘ 


MEAN OF THE NORMALS. R INDICATES REVERSED ORDER. 
Measure 


Age 
Dark vision (25 mins 
Visual acuity 


Color Form attitudes (Ran! 
kK Test 
New ke r cube 
Dotted outlines R 
Muller-Lyer illusion 
Sander illusion 3.2 1.34 
Poggendorff illusion R ; 56 
Kundt illusion (fast pe rformance 62 
Aubert phenomenon (T ibsolute vol 55 57 
After-image (fluctuation) : ; **3 .00 
After-image (second distortion score 3 ) 30 
Emmert’s Law (fir n stimulu E , 08 
Emmert’s Law (third screen ulus 7 1] 
rism glasses ; 5 


-hand ¢ 


1an 


O09 S 


e-hand co-ordi 


] 


e-hand co-ordination 


yressive movement 


P 
I 
I 
I 
I 
I 


X] 

xpressive movement 
xpressive movement 
Maudsley questionnail 
food dislikes R 
Annoyances 


W orri¢ 8 


2 Mw 


be be pe be OO 


*Significant at 5% level. 


a it 1% level. 


mean for the normals and the mean for the neurotics. Table 4 summarizes the re- 
sults. Column A shows the number of tests which discriminated at the 5 per cent 
level of significance and column B shows the number of tests which discriminated at 
the 1 per cent level of significance. 
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TABLE 4. SUMMARY OF THE TESTS DISCRIMINATING BETWEEN NORMAL, NEUROTIC AND PsycHorTic 
Groups aT 2 LEVELS OF SIGNIFICANCE 


pared Groups With 


Intermediate Number of Tests Signifi Number 


) <. .05 < n. 8 


Mean 


f 


Neurotics 9 20 16 
Normals . 11 


Psychotics 


Neurotics 
Normals 


Psvcehotics 


Neurotics 
Normals 
Psychotics 


The findings that on 41 out of 81 tests the mean for the neurotics did not occupy 
a position intermediate between the normals and psychotics gives experimental 
support to Eysenck’s theory that neuroticism and psychoticism form two independ- 
ent continua. Unfortunately, the data indicate that out of a list of some 91 tests 
scores only 7 discriminated at the 5 percent level of significance between psychotics 
and neurotics, and of these 7 tests only one test (CFF) discriminated all 3 types of 
subjects from one another 


SUMMARY 

Data presented by Eysenck et al“) have been reanalyzed to obtain t-tests of the 
statistical significance of the differences between the three groups of subjects. When 
a 5 per cent level of significance is used, we find that 44 tests discriminated between 
psychotics.and normals and 34 tests discriminated between neurotics and normals, 
but only 7 tests discriminated between psychotics and neurotics. CFF was the only 
test which discriminated all three types of subjects. The results tend to support 
Eysenck’s bi-factor theory of neurosis vs. psychosis, but show that many of the tests 
lack discriminatory power. 
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HOMOSEXUALITY AND CREATIVITY 
ALBERT ELLIS 
333 West 56th Street 


New York, N. Y. 


PROBLEM 

Although there is much speculation about homosexuals being more artistic or 
creative than heterosexuals, a search of the literature discloses no objective studies 
in this connection. Indeed, studies of homosexuals which employ an adequate con- 
trol group of heterosexuals, and which compare any of their characteristics or aspects 
of their behavior with those of a control group, are very rare. This study compares 
a group of homosexual patients in psychotherapy with an equated group of highly 
heterosexual psychotherapy patients to determine group differences in clinical 
diagnosis, creativity, and clinical improvement during therapy. 
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METHOD 

Sixty-six individuals with severe homosexual problems and 150 exceptionally 
heterosexual psychotherapy patients, who were seen by the writer for a minimum of 
ten and a maximum of 350 sessions, were equated for age, sex, and educational 
status. The homosexual patients were further divided into three subgroups: (a) 
9 male and 10 female bisexuals (that is, individuals with a strong homosexual com- 
ponent, including a history of overt homosexuality, but who also had a reasonably 
strong heterosexual component); (b) 23 male and 10 female fixed homosexuals (that 
is, individuals who were exclusively or predominantly homosexual but who main- 
tained their own sex role); (c) 10 male and 4 female inverts (that is, individuals who 
were exclusively homosexual and who maintained what Brown") has called sex role 
inversion, with the males idopting a female or ‘‘fairy’’ role and the females playing 
a masculine or ‘‘butch”’ role in thei veryday living 

Each of the patients was rated by the therapist, on the basis of his intimate 
psychotherapeutic experience with the subject, as to whether he or she was (a 
neurotic, borderline psychotic, or psychotic; (b) highly creative, moderately creat- 
ive, or non-creati istinctly more creative or little more cre: » following 
therapy; (d) cons 


and ie considera 


3 little improved clinically during therapy; 

bh. y, or little changed in the direction of heterosexual 

behavior during th ‘). In the ratings for creativity, the criteria hypothesized 

by Guilford (“sensitivity to problems, ideational fluency, flexibility of set, idea- 

tional novelty, synthesizing ability, analyzing ability’’) and English and English © 

“to find new solutions to a problem or new modes of artistic expression’’) were 
employed 


iV, OF 
] 
I 


RESULTS 
TABLE 1 AT Oo IETWEEN XCEPTIONALLY Hi 


i » t > ta 
Exceptior il | wwients 


Heterosexual sisexual ixed Homosex 


Inverts 


Is 


CREATIVITY 
Highly creative 
Moderately creative 
Non-creative 
INCREASE IN CREATIVITY 
OLLOWING THERAPY 
net increase 


ase 


AL IMPROVEMEN 
NG THERAPY 
Considerable 
Distinct 

Little or none 


RIN 


CLINICAL IMPROVEMENT IN REI . 
EXUAL PROBLEM DURING THERAP} 
Considerat le 
Distinct 
Little or none 
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The results of the therapist’s ratings on the 66 homosexual and 150 highly 
heterosexual patients used in this study are shown in Table 1. When some of the 
observed findings in this table are subjected to Chi-square analysis the following 
ignificant and non-significant differences are obtained: 

1. The exceptionally heterosexual subjects are significantly more often 
diagnosed as neurotic and less often diagnosed as borderline psychotic and 
psy hotie than the exceptionally homosexual (that is, fixed homosexual and 
inverted) subjects (p = .01). 

2. The exceptionally heterosexual subjects are significantly more often 
rated as highly or moderately creative and less often rated as non-creative than 
the exceptionally homosexual subjects (p = .01 


3. With clinical diagnosis held constant, the neurotic heterosexual sub- 
jects are more often rate d as highly or moderately creative and less often rated 
as noncreative than the fixed homosexual and inverted subjects; but this differ- 
ence is not statistically significant. 

+. The extremely heterosexual subjects are more often rated as showing a 
distinct increase in creativity following psychotherapy than the extremely 
homosexual] subjects; but this difference is not statistically significant. 

5. The exceptionally heterosexual subjects are significantly more often 
rated as showing considerable or distinct clinical improvement and less often 
rated as showing little or no clinical improvement during therapy than the 
exceptionally homosexual subjects (p = .01 

6. With clinical diagnosis held constant, the neurotic heterosexual subjects 
are significantly more often rated as showing considerable or distinct clinical 
improvement than the neurotic fixed homosexuals and inverts (p = .01). 

7. Compared to the fixed homosexual and inverted subjects, the bisexual 
subjects are more frequently diagnosed neurotic and less frequently borderline 
psychotic und psychotic (non-significant) ; significantly more often rated highly 
or moderately creative and less often rated non-creative (p = .06); more often 
rated as having increased their creativity following therapy (non-significant) ; 
significantly more often rated as showing considerable or distinct clinical im- 


provement during therapy (p = .05); and significantly more often rated as 
showing considerable or distinct gains in heterosexuality during therapy 
(p 01 


8. Compared to the homosexual inverts, the fixed homosexuals without sex 
role inversion are more often diagnosed as being neurotic and less often as being 
borderline psychotic or psychotic (non-significant); significantly more often 
rated as being highly or moderately creative p= .05): more often rated as 
having increased their creativity following therapy (non-significant); signi- 
ficantly more often rated as showing considerable or distinct clinical improve- 
ment during therapy (p = .05); and more often rated as showing considerable 


or distinct gains in heterosexuality during therapy (non-significant 


DIscUSSION 

The obtained data very consistently, though at times non-significantly, uphold 
the hypothe sis that degree of homosexuality is directly related to both degree of 
emotional disturbance and loss of creative potential. This hypothesis would also 
tend to be supported by the findings of Drevdahl®’, who found that although 
creative individuals tend to be more non-conforming than non-creative ones, they 
also tend to be more self-sufficient and flexible. In a follow-up study, Drevdahl and 
Cattell“ found that creative artists and writers differed from the normal population 
in being more emotionally mature, dominant, adventurous, and less cyclothymic, 
surgent, and subject to group standards and control. The findings of the present and 
these confirmatory studies would ‘tend to question seriously the long held stereotype 
of the creative individual as neurotic or psychotic. 
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It may well be, as Drevdahl’s original study inc ‘cates, that if we limit the term 
‘creative’ to individuals working in artistic or esthetic fields, such individuals are 


1 


somewhat more suspl ious and disturbed than are other creative individuals (such 


as scientists) or non-creative persons. But if we include under the rubric of creativity 
all kinds of inventive, original individuals, the traditionally held stereotype quickly 
breaks down. 

In the case of homosexuals in particular, it may well be that the illusion that 
is commonly held of their being unusually creative individuals results from such 


ili 


factors as (a) thei frequently gravitating to est c rather than scientific or in- 
dustrial fields: D the ease of one’s vAainin popula! fame or notoriety, in our culture, 
when one is prominent in esthetic or artistic rather than other creative areas: (c) the 


common confusion which mance (as in the 


theatre, the ope 1, or the dance) : an ginal creattior g., a fine play, invention, 
or industrial te hnique ; ac it! I homosexual creators become 
unusually well-known because of their aberrated | ves, while m unvy heterosexual 


originators live relatively unobtrusive and relatively unKNOWN Lives. 


SUMMARY 


Sixty-six individuals with severe homosexual problems and 150 exceptionally 
heterosexual psychotherapy itie ; ated by the ther ipist as to clinical diag- 
nosis, creativity, increase in cre V1t) llowing th py, clinical improvement, and 
in the case of the homosexuals) int ase 1n neterosexualt! The main findings were 


1. Compared to the homosexual patients, the exceptionally heterosexual sub- 
jects were significantly less emotionall isti . more creative, and more clinically 
improved during therapy. 

2. Compared to the fixed homosexual and inverte itients, the bisexual sub- 
jects were significantly more creative, more clinically improve ‘ing therapy, and 


made greater gains in heterosexuality 
Compared to the homosexual invert died, fixed homosexuals without sex 


role inversion were significantly more and more clinically improved during 


9 
o 


therapy. 
4. It appeared likely that a good part, tl rn | bably 1 all, of the obtained 
relationships between degree of homosexuajity, on the one hand, and lesser creativity 
Tt 


and clinical improvement with therapy, on the other hand, was attributable to the 


greater severity of emotional disturbance found amo! tn 10sexual patients, 


REFERENCES 


yblems 
ENGLISH 
analytical tern 


6. GUILFORD, 
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PROBLEM 


Disposition of a patient entering a psychiatric out-patient clinic depends to a 
large extent upon the evaluation made of the patient’s complaints during the initial 
screening procedure. One important consideration in such disposition will undoubt- 
edly be the nature and relative severity of the presenting complaints. The purpose of 
the present study is to examine whether there are stimulus dimensions among the 
multitude of possible psychiatric complaints which lead to differential judgment of 
the severity of such complaints on the part of the intake interviewer. An attempt 
will also be made to see whether a classification of psychiatric complaints can be 
made in terms of a scheme that is meaningful to professional raters from three mental 
health professions and to test the hypothesis that judgmental differences may be a 
function of difference in professional affiliation. While it is known that there ar 
differences of opinion as to the characteristics of personality disturbance among 
members of these professional groups"), it remains to be seen whether such differ- 
ences also will be reflected in the process of assessing the severity of patient com- 
plaints. To overcome the difficulty of adequately sampling widely diverse profess- 
ional groups, use 1s made of the replication of small samples and of obverse factor 
analysis. 


PROCEDURE 


In the course of another study approximately 200 successive case files from a 
psy hiatric o it-patient clinic were examined and all presenting complaints noted “>, 
[hese complaints were then sorted into different categories until a meaningful sys- 
tem of classification emerged . In contrast to studies such as that by Lorr and 
Rubinstein“, or Tatom “?, the experimenters were not concerned with classification 
nto a given nosological system of psychiatric diagnosis, but rather with the meaning- 


l 
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assignment of verbalized complaints to a system on which agreement by pro- 
fessional raters would be possible. Two dimensions were found to be required to place 
each complaint. These dimensions were named the “‘determinants”’ and “‘referents”’ 
of a given complaint. The determinant identifies the specific nature of the complaint 
he referent gives the attributed cause or result to which the nature of the com- 
plaint refers. Seven determinants and five referents were found relevant for classifi- 
cation and were named as follows: 


DETERMINANTS REFERENTS 

1 Affective f Physical health 

2. Anxietal b. Mental health 

3. Behavioral + Economic 
Mentational . Inter-personal 
teality distortive . Non-specific 
Welfare 


i. pomatic 


The combinations of determinants and referents provide 35 possible classifica- 
tion units. An example for a complaint which fits category la would be: “‘I feel blue 
because my health is bad.’’ The determinant of this complaint is of an ‘‘affective’ 


'This paper is one of a series reporting a larger study of mental health problems of older persons 
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nature (7.e. I fee] blue) and names as its cause problems referring to “physical 
health’’.2 All complaints were then sorted into these categories and a balanced Q-sort 
sample was composed by selecting two representative complaints from each category. 
This gave a total of seventy complaints. Professional rat 


ers were asked to distribute 
the se eli 


y complaints in a quasi-normal Q-sort distribution containing 11 intervals. 
This met isaconvenient pros edure for ranking a relatively large number of items 
inuum using manageable numbers of discrimin: 

Raters were instructed to distribute the complaint items in their order of 

: h they considered to be the 
1 score of ten and those two 

h yielded a score of 0. This 
vccording to the required 


tive choices 


They were asked to piace the two items wh 
most disabling complaints into the category ident 
items which were least disablin; 
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TABLE 1. INTERCORRELATIONS BETWEEN Raters (N = 70 CompLarnt STATEMENTS) * 
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*Corre la utions are sign ‘ific ant at the 1% re any of confidence if the sy eed .30, and at ; the 5 5% 
level if they exceed .23. 


variance which is specifically attributable to professional identification, regardless of 
any other factors operating in the judgment of the seventy psychiatric complaints, 
then by maximizing the variance common to the members of each professional 
triplet and by minimizing that portion of the variance common to both the members 
of the triplet and all the other members of the set of raters, one should be able to 
obtain a solution where in a three factor system (one for each profession) simple 
structure can be shown to exist. That is, if, and only if, there is a unique factor 
contributing to the manner in which members of the diffe rent professions rank psy- 
chiatric complaints, then the factorial solution will contain one factor for each group 
on which the members of the professional triplet have positive and significant load- 
ings, while all other raters have zero or chance loading. 

Table 2 presents the factor loading matrices before and after oblique rotation. 
If one specifies that a factor loading below .20 is to be considered a chance deviation 
from zero, then examination of this table leads to the conclusion that the hypothesis 
as to the presence of an unique judgmental factor related to the professional group- 
ings is not adequately supported. There is overlap on each of the three factors in at 
least one of the two sets and it is clear that this solution is not the most parsimonious 
one. On both sets, however, there seems to be some unique factor saturation for 
some of the psychiatrists and the social workers, while the hypothesized ‘‘psycholo- 
gist factor ’ would seem to represent a dimension of inter-professional communality. 
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TaBLe 2. Facror-LoapINGs FoR THE Hyporsesis-Testinc Factor ANALYSIS* 


Multiple Group Factor Loadings Factor Loadings After Oblique 
Rotation 
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*Factor loadings greater than .20 are considered to be sig 
Analysis of variance and examination of mean scores. The analysis of variance was 
next used to test hypotheses with respect to the meaningfulness of our classification 
system and to assess further the effect (if any) of mez shane in a specific profession 
upon the rankings made. Since the scores on the different categories were all ob- 
tained by ()-sort procedure, the variance for individual raters and professional 
groupings are, of course, required to be equal by definition. An overall analysis of 
variance test is possible, however, al though group and indivi dual differences must 
be tested by examining the simple interactions.® 

The applicable null hypotheses for this _— sis may be stated as follows: (a) 
there are no differences between the various complaint determinants; (b) there are no 
differences between the various complaint referents; (c) there is no significant inter- 
action between complaint determinants and referents; (d) there are greater differ- 
ences among raters within each professional group than among the groups on the 
complaint determinants; (e) there are greater differences among raters within each 
professional group than among groups on the complaint referents; (f) the interaction 
between determinants and referents within each professional group is larger than the 
interaction between the complaint dimensions among the groups 

Table 3 shows the summary for the analysis of variance and indicates that 
hypotheses a, b and c must be rejected Significant overall differences among the 
complaint determinants, referents and their interaction suggests that the classifica- 
tion system is reasonable and useful in distinguishing between different types of com- 
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plaints. Hypotheses d, e and f are accepted and suggest the conclusion that there are 
no significant differences among the professional groupings in assessing the severity 
of either specific complaints, complaint determinants or complaint referents. The 
results for this analysis are shown to be essentially replicated for the second set of 
raters. In both instances, the hypotheses with respect to the meaningfulness of the 
classification system are confirmed and the hypotheses with respect to interpro- 
fessional differences rejected. 


What then are the dimensions underlying the d 


gment of a complaint’s sever- 
ity? A clue may be found in the mean scores for each of the a priori sorting classifica- 


tions. Tables 4 and 5 give the mean scores for each of the complaint determinants 


ORES OF rHE COMPLAINT REFERENTS 
FO! iA LATER CH GROUP) 


All Raters 


‘9 


59 


1d con plaint relerents Since the initial analysis suggests that the two sets of 


are quite simular, thei ratings were now poole d and mean scores are cviven for 
the six members of each professional grouping as well as for all eighteen raters. Using 


the within cells variance over all the raters (from the an ilysIs of variance) the stand- 


MEAN Sco! F RATERS ON THE COMPLAINT DETERMINANT 
10 Stat rs FoR Eacu or 6 Raters in Eacu Grout 


ard error of the difference for scores on the complaint determinants was found to be 


.30 and for the « omplaint referents to be .26. Using the 38. D. criterion, significant 


differences m in a magnitude of .90 score points for determinants and .78 
score points for the referents. Only one determinant shows a significant difference, 


the psychologists rating ‘‘mentational ’’ complaints more severe than did the social 


i 


nces 


workers. None of the referents showed significant differe 


When one examines the mean differences among categories for all the raters, 
however, nificant findings occur. The standard errors of the difference for 
this analysis will be .18 and .15 and the significant score magnitudes (at the 3 8. D 
level), .54 and .45 for the complaint determinants and referents, respectively. All 
raters agree that ‘‘reality distortive’’ complaints are be considered most severe. 
‘““Mentational” complaints are next in severity and receive a significantly higher 


mean score than all other determinants except “reality distortive’”’ and ‘‘affective’’. 
“Welfare” complaints (7.e., asking for aid of a non-psychiatric type) are rated sig 
nificantly less important than all other determinants. 1ong the referents ‘‘mental 
health”’ and ‘“interpersonal”’ receive the highest nea scores, both of whi h are slg- 
nificantly higher than ‘physical health’’, “Economic” is rated significantly less 
severe than all the other complaint referents. 
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A‘simple structure solution. Since there are considerable differences among individual 
raters in assessing the relative severity of each class of complaints it was hypothe- 
sized that there might be other subsidiary dimensions which would affect such 
ratings. To get at these subsidiary dimensions the inter-correlations among all raters 
(see Table 1) were factored by the centroid method. Five factors were extracted and 
rotated by the radial method with a view towards achieving simple structure within a 
positive manifold®), Table 6 lists both the centroid and the oblique factor loadings. 


Tasie'6, CenTRom ANALYSIS OF THE INTER-CORRELATION AMONG ALL RaTERS 


Centroid Oblique 
Factor Loadings Factor Loadings 


’sychiatrist 
’sychiatrist 
chiatrist 
hiatrist 
chiatrist 
niatrist 
chologist 
chologist 
chologist 
hologi 
holog 
hologist 
al V orkKe 
il Worke 
wial Worke 
cial Worke 
ial Worker 4 
ul Worke 


Factor E seems to collapse upon Factor A, and an attempt at interpretation is there- 
fore made for the first four factors only. 

Factor arrays were computed for the four factors by selecting the three ratings 
which had the highest oblique loading on each factor. All statements were then re- 
ordered in terms of these factor arrays. Table 7 lists the five most extreme statements 
in each factor array. AJl the arrays contain statements which have “reality dis- 
tortive”’ aspects. Thus, as already noted, the common aspect of ranking complaint 


TABLE 7. Most ExTREME STATEMENTS IN TERMS OF THE Factor ARRAYS 


Factor A 
1. I have unusual ideas about what is happening to my body. 
pomeone is trying to harm me 
I am so nervous | can’t work. 
I want something done because I am in poor health. 
I want to go to a hospital because I am upset. 
Factor B 
1. I believe I have special powers. 
I hear or see things no one else does. 
I get strange ideas when I get nervous 
I have unusual ideas about what is happening to my body. 
Strange things happen to me when I get upset. 


Factor C 
1. I hear or see things no one else does. 
Someone is trying to harm me. 
I believe I have special powers. 
I feel I’m a burden to my family. Maybe they would be better off without me. 
I worry about losing my mind. 
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Factor D 
1. I am so nervous I can’t work. 
I lose jobs because of the way I act 
I have unusual ideas about what is happening to my body. 
I have trouble at work because my mind wanders and I get confused 
Someone is trying to harm me. 


statements is in terms of a dimension reaching from reality to irreality, the latter 
being considered the more severe. In terms of the factor arrays we can identify four 
additional dimensions. Factor A seems to reflect that ‘desire of the patient to be 
taken care of’’ is the basis of judging severity of complaints. Factor B seems to be 
concerned with the ‘‘distortion of the self-concept”’; Factor C with the ‘‘disruption of 
interpersonal relations; ’ and Factor D with the “‘economic problem” associated with 
the respective complaints. 

In the light of the earlier analysis of possible professional differences, it is of 
interest to note that four of the six psychiatrists have significant loadings on Factors 
\ and B, while four of the six psychologists have significant loadings on Factor B and 
five of the six on Factor C. This finding might suggest more concern of the psychia- 
trists with the patient’s desire to receive help as a significant variable in a complaint’s 
severity and njore concern of the psychologists with the disruption of interpersonal 
ties. Member’ of all three professions, however, have loadings on all the factors, 
underlining the tentative conclusion that individual value systems rather than pro- 
fessional affiliation are of major importance in mediating the judgment of the severity 
of a psychiatric out-patient complaint. 


SUMMARY. AND CONCLUSIONS 

A balanced sample of sey enty complaint statements ty pical of patients visiting a 
psychiatric out-patient clinic was selected from a larger s imple of complaints culled 
from clinic files. Two sets of raters, each consisting of three psychiatrists, three 
psychologists and three social workers rank-ordered these complaints in the order 
of their severity using Q-sort methods. 

All complaints were classified in terms of a two-dimensional system of deter- 
minants and referents. The results of an analysis of variance indicates that there 
were significant differences among individual raters, but that there was more overall 
divergence within each professional group than among individuals from different pro- 
fessional backgrounds. Complaints referring to mental health, interpersonal, or non- 
specific sources were felt to be most significant by all raters, while those referring to 
economic and physical health dimensions were rated least severe. In terms of the 
determinants, complaints of a reality distortive or of mentational nature were held to 
be most severe, while those of a welfare or somatic nature were considered least 
severe. 

The rankings made by the members of each set of raters were correlated and 
factored by the multiple-group method to test the hy pothesis that there are unique 
factors associated with the professional identification of the raters which will in- 
fluence their judgment of the severity of psychiatric complaints. No such unique 
factor could be identified. 

In a second factor analysis of the correlations among all the raters, using the 
centroid method, an oblique rotation resulted in four identifiable factors. These were 
named ‘‘desire of the patient to be taken care of ’, ‘‘distortion of the self-concept”’, 
“disruption of inter-personal relations’’ and ‘‘economic problems’’. Again, members 
of all professional groups show loadings on all factors. 

If there are indeed unique stimulus dimensions operating in the judgment of 
the severity of psychiatric complaints which are related to professional identification, 
then these dimensions should emerge in a factor analysis design such as the one used 
here for any set of raters. The characteristics of a particular rater would effect pri- 
marily the magnitude of the specific factor loadings but not the simple structure 
solution if such unique dimensions exist. The results suggest that at least for 
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samples of complaint statements gathered in a municipal out-patient clinic a solution 
specifying effect of professional identification on the judgment of the severity of 
psychiatric complaints is not parsimonious and that there was more communality 


than divergence among the raters participating in the study. Instead, evidence is 


reported which suggests that such judgment may be mediated by the raters’ in- 
dividual value systems with respect to the nature of the specific complaints. 
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THE COMPARATIVE RELIABILITY OF TWO TECHNIQUES OF 
PERSONALITY APPRAISAL! 
PAUL HORST AND CALVIN E. WRIGHT 


University of Washington 


One of the traditional objections to the self-appraisal type of noncognitive 
psychological instrument is the tendency for the subject to give what he regards as 
the socially approved response to the items or statements on the inventory. A 
method credited by Baier to the senior author has been used extensively in an 
effort to overcome this objec tion. It consists essentially of the taltewinn stepe: 

1. Several or more dimensions are postulated for a given personality, temperament, interest, 
or other noncognitive domain. 
2. Sets of items assumed to be relevant to the respective postulated dimensions are written. 

3. The extent to 
response is determined 
For each item, then, asc 


which agreement with each item is regarded as a socially acceptable 
by some scaling tech ique considered appropriate by the investigator 
ile vi of social desirability is available 

4. Pairs of items are next arranged so that the two items have approximately the same 
social desirability scale value but represent different dimensions. Typically each item is paired 
with the same number of items as is every other item, so that on the final format all items appear 
the same number of times 

5 A subje ct is then require d for each pair of items to indicate which of the two most nearly 
applies to him. The score for each dimension or trait is the number of items from that trait 
the subject. 


( he¢ ked by 
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This procedure has come to be known as the forced choice technique. Well 
known variations of it consist of arranging items in triads or tetrads, rather than in 
duads or pairs. The senior author’s original notion in proposing the technique Was 
that, since the items were matched for social desirability, the subject would be forced 
to choose on the basis of his relative position with respect to the two dimensions 
represented by the pair. He would therefore be responding to the dimensionality or 
trait aspects of the items rather than to what he regarded as socially desirable. 

\nother advantage of the forced choice technique is that a relatively small 
number of items for each dimension can yield a wide range of scores for that dimen- 
sion. For example, if there are nine items for a dimension and each is paired with 
three other items, the range of possible scores for a dimension or trait is from 0 
through 27 It should, however, be noted that this feature of the technique, while 
economical in item writing time, is not economical in test administration time, since 
the subject must respond to each member of all of the pairs in the format. 

A serious defect in the forced choice technique was early recognized by the 
senior author and led him to abandon it as generally suitable for noncognitive 


psychological measurement. For a properly constructed set of forced choice scales 


the sum of a person’s scores for all dimensions or traits considered must be the same 


as the sum of the scores for any other person. This results in what Cattell © has 
called ipsative scores. Scores of this kind can be used to compare a person’s standing 
on a given dimension with his standing on another dimension. They cannot be used 
to compare his standing on a given dimension with that of another person on the 
same dimension. Scores suitable for this latter purpose are known as normative 
scores. The characteristics and limitations of ipsative type scores have been de- 
veloped in great detail by Clemans®?. It is true, however, that such scores, even 
though appropriate only for intra-individual comparisons, can be of great practical 
value. If they can be freed from the factor of social desirability the forced choice 
techniques in noncognitive psychological measurement might be justified on this 


| 


] 


basis aione, 

The question still remains whether methods equally ec onomical of construction 
time and more economical of administration time could be developed which would 
still tend to suppress the social desirability variable. Now essentially a forced choice 
set of scores for an individual may be regarded as a set of normative scores expressed 
as deviations from the person’s own mean. The mathematical basis for this method 
of regarding ipsative scores can be readily demonstrated One might, therefore, 
construct sets of items for each postulated dimension and present them in con- 
ventional format, scoring them so that a person's score on a dimension is derived 
from his responses to the individual items for that dimension. The scores for each 
dimension for a group of subjects could then be appropriately scaled such as by con- 
version to standard scores. For each person, ther , the standardized measures could 
be expressed as deviations from his own mean of standard scores. Such scores might 
then again be standardized over persons This process we will call arithmetical ip- 
satization. 

Suppose now a set of scores on the same dimensions as those just considered had 
been obtained experimental y me a it 1 choice format and expressed in 
standard score form. We might regard these scores as comparable to those obtained 
as indicated in the preceding paragraph n h cases we would assume that the 

ial desirability factor hac en largely , d. z us we assume the same 
items were used in both cases, ‘ation time uld be much less in the 
former procedure than the latter, since each item wa 101 repeated a number of 
times with another item 


One objection to the conventional proced ire as distinguished from the forced 


choice procedure is the possibility that most of the variance he raw scores ob- 
tained from the non-forced choice format is due to the social desirabil ty variable. 
If this is the case the correlation among the dimensions, as measured by the raw 
scores, W ill be higl . and the subtraction ( f eat h person 8 mean from his own scores 
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may tend to leave a high proportion of error variance in the residual scores. This in 
turn would result in low retest reliability of the arithmetically ipsatized scores. 

It is therefore reasonable to hypothesize that even though one may suppress the 
social desirability factor by the arithmetical ipsatization of sc ores obtained from a 
format administrative ‘ly much more economical than the forced’choice, nevertheless 
the former scores might be much less reliable than the latter. This hypothesis can 
be readily tested by repeating two administrations of both types of formats to the 
same group of subjects. In the conventional format one must specify the possible 
responses to each item. These may be simply ‘‘yes’”’ or ‘‘no”’ or they may include a 
series of possible responses varying in degree of agreement, applicability, or what not. 
When a range of possible responses is ‘Prov ided for each item we may Call the format 
a rating scale. 

THe EXPERIMENT 

The forced choice type of noncognitive self-appraisal instrument chosen for this 
experiment was the Edwards Personal Preference Schedule“). It appears to have 
been carefully constructed and has been the subject of unusually competent research. 
The EPPS yields scores on 15 different postulated variables. It consists of 225 pairs 
of items or a total of 450 individual items. Each variable is represented by nine 
different items. Eight of the nine are used three times each and one of the nine is 
used four times. The sum of a person’s scores for the 15 variables is 210 while the 
maximum score for any one variable is 28. 

A rating scale format was made up by presenting in random order the 135 distinct 
items in the EPPS. The subject was required to indicate to what extent each state- 
ment was a good or poor description of himself by marking one of the following 
categories on an answer sheet: As good as possible, very good, quite good, rather 
good, only fair, rather poor, quite poor, very poor, as poor as possible. The possible 
range of scores for each item was from 1 for ‘‘as poor as possible’”’ to 9 for ‘‘as good 
as possible’. The raw score for each variable was the sum of the scores for the items 
assigned to that variable. Thus the highest possible score for a variable was 81. 

It should be noted that increasing the range of possible responses to an item 
tends to increase the reliability of the item scores and hence the score for a variable 
without increasing the administration time appreciably, if any. As a matter of fact 
this latitude of response is doubtless less frustrating to the subject than to be forced 
to choose categorically between ‘‘yes’’ and ‘‘no”’ in a conventional format or between 
two items in a forced choice format. 

The EPPS was administered twice to about 150 undergraduate students with 
approximately one week between administrations. The conventional format was ad- 
ministered the day after each EPPS administration. Of all subjects tested, 92 had 
complete data for all four sessions. There were 44 males and 48 females. The average 
age was 21.9 years. The administration time for the EPPS was approximately 
one hour whereas for the conventional format it was only about 20 minutes. 

For the conventional format the item scores were punched from the answer 
sheets into IBM cards. Scores for the 15 variables were computed for both adminis- 
trations on the IBM Type 650 by means of a program developed by Wright“. Both 
sets of rating scale scores were then arithmetically ipsatized to suppress as much as 
possible of the social desirability factor. 


RESULTS 


The retest reliabilities of the mathematically ipsatized rating scale scores and 
those of the EPPS forced choice scores are given in Table 1, together with the 
differences between them. It will be noted that, contrary to the hypothesis, for only 
one variable, Abasement, was the reliability higher for the EPPS than for the ip- 
satized rating scale format and this was only a difference of .01. The greatest differ- 
ence was for Exhibition. Here the ipsatized rating scale reliability exceeded the 
EPPS by .24. The average reliability for the ipsatized rating scale form was .87 and 
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TABLE 1. Trst-Retest REvIABILITY CoEFFICIENTS (N = 92) 


[psatized Edwards Difference 

Variable Rating Scale PPS 
Achievement ‘ 83 
Deference 7 67 
Order 81 
Exhibition : 64 
Autonomy .76 
Affiliation .80 
Intraception a 84 
succorance te’, is 
Dominance .o2 
Abasement &: 84 
Nurturance 
Change 
Endurance 
Heterosexuality 
Aggression 


that for the EPPS .78, or an average excess of .09 in favor of the rating scale form. 
This excess in reliability was achieved even though the administration time for the 
rating scale form was only about one-third that of the EPPS. 


SUMMARY 


If one may generalize from these data it appears that any advantage which the 
forced choice type of self-appraisal instrument may have over the arithmetically 
ipsatized rating scale form must be other than that of greater reliability. Other 
possible advantages should be investigated in further research. 
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RELIABILITY OF DIAGNOSTIC JUDGMENTS MADE BY 
PSYCHOLOGISTS! 
ALLAN GOLDFARB 
City Health Department, Baltimore, Maryland 


PROBLEM 

The diagnosis is a crucial consideration in deciding upon alternative recommend- 
ations for a psychiatric patient. These decisions may involve important consequences 
for a patient, such as the choice of treatment, commitment to an institution, or 
separation from military service. One could be more confident of the appropriate- 
ness of his course of action if the diagnosis given a patient were the same regardless of 
which clinician made the evaluation. The purpose of the present study is to investi- 
gate both inter- and intrajudge agreement in making diagnostic judgments. 


METHOD 

Judges. The four psychologists who served as judges were on the staff of the VA 
Mental Hygiene Clinic in the Baltimore Regional Office. The clinicians were a 
homogeneous group. They had Ph.D. degrees in psychology and all had completed 
the VA Clinical Psychology Training Program. Diagnostic evaluations were a 
routine part of their clinical activities. The Diagnostic and Statistical Manual for 
Mental Disorders“? was used by the group as the criterion for classification of the 
patients. For any particular diagnostic problem the clinicians used fairly similar test 
batteries, which almost invariably included-the Rorschach test They had in com- 
mon similar professional experiences, having worked in this clinic from two to five 
years at the time of this study. This included joint participation in staff meetings 
and in-service training programs relating to their diagnostic function. 


Diagnostic Data. The clinical records were those of veterans who had come to the 
VA for assessment of service-connected psychiatric disabilities in relation to pension 
or compensation claims. All had been evaluated by the same psychiatrist, who re- 
ferred them to the psychology section for diagnostic appraisal. These referrals 
typically included the more complex and difficult diagnostic cases. The referrals were 
assigned to the psychologists by the secretary from a roster. These assignments were 
based on the availability of the clinicians and there was no selectivity by kind of 
diagnostic problem to be evaluated. 

Before seeing a patient the clinician generally had available the referral sheet, 
which included the psychiatrist’s provisional diagnosis, and the patient’s military 
record, which described his psychiatric difficulties in the service. The psychologist’s 
report was focused on the patient’s diagnostic evaluation and was rarely more than 
a page long. The report and the test material comprised the basic data for this study. 

Throughout the study the diagnoses given by the psychologists corresponded to 
the specific psychiatric conditions termed “‘reactions’’, as defined in the manual”), 
For every hypothesis tested these specific diagnoses were grouped by the £ into their 
relevant “disorder” categories, which consisted of groups of related psychiatric syn- 
dromes. Consequently all statistical analyses reported in this study were based on 
the specific diagnoses grouped in the five categories of the psychophysiologic, psycho- 
neurotic, psychotic, personality, and brain damage disorders. The manner of re- 
cording diagnoses“: P- 5°) was reviewed at the start of the study. This was done to 
insure that the diagnoses given by the judges were consistent in describing the 
patient’s psychiatric conditions in terms of reactions, in contrast to formulating 
complementary diagnostic data. None of the data were categorized or statisticaily 


1The author was a staff member in the VA Mental Hygiene Clinic, Baltimore, Maryland, at the 
time this study was conducted. Credit is due Dr. Morris Roseman, Chief, Psychological Services, and 
Dr. James P. O’Connor, VA Consultant, for their encouragement and aid with the design and com- 
pletion of this study. The statements and conclusions of the author do not necessarily represent those 


of the VA. 
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analyzed until the entire study was completed, as to minimize any bias in the 
evaluations of the Z, who served as Judge A. = addition, Judge A completed his 
evaluations prior to those made by the other clinicians 


INTERJUDGE AGREEMENT 
The data consist of diagnostic judgments made by the clinicians on (a) groups 
of randomty assigned patients; (b the same group of modified clinical reports; and 
(c) the individual modified reports 


Procedure and Results. (a Groups of Randomly Assigned Patients. To obtain a 
sufficient number of cases all evaluations done by the four psychologists between 
July 1955 and November 1956 were collected from the files. For the group of judges 
there were from 51 to 71 cases. There were no statistically significant differences in 
age or education for the four patient subsamples For each judge the specific 
diagnoses listed in all his clinical reports were grouped by the EF under the appro- 
priate diagnostic category. As indicated in Table 1, Chi Square analysis reveals a 


TABLE 1. CLASSIFICATION OF DIAGNOSES ORIGINALLY MADE BY Eacu JUDGE ON H1s Own CasEs* 


Judge 
Diagnostic 


Category 


Peych 

Psycl 

edie 

Personality 

Brain damage 
Total 


Square 62.61; 23; P 


very marked difference in the diagnostic percent distrib uutions for the judges. 
These findings were suggestive of considerable in rj ge variation be yond chance 
expectations. 
(b) The Same Group of Modified Clinical Report The diagnostic data consisted 
100 clinical reports. Included were 25 reports from each of the judges’ original 
cases selected by using a table of random numbers. Before the reports were judged 
they were modified by removing identify ing data, which consisted of the patient’s 
name, referral question, diagnosis, and the clinician’s name. A member of the staff 
who did not participate in making the judgments selected the 100 reports and also 
deleted the e sper ified data from them 
The psy hologists were asked to make a diagnosti judgm ent for each of the 100 
reports. ° Thi Vas acco! npli ished in three sessions approximately a week apart. Each 
clinician made his evaluations independently of the others. All the diagnoses given 
by each judge were dacuihed by the F in the relevant diagnostic category. The re- 
a of this procedure are presented in Table ‘ \nalysis of the data indicated 


rke d Int terjudge vark ib lity beyon l chan e expectations. 
TABLE 2. CLASSIFICATION AND NUMBER OF DIAGNOSES MADE By Eacu JUDGE 
ON THE SAME 100 Mopirrep CLINICAL REPORTS* 
Diagnostic 


Category D 


Psyc} 
Psy he 
P svc } 
Personality 

Brain damag 


pi 


otic 


Total 


*Chi Square 
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For each judge a comparison was then made of the diagnostic percentage dis- 
tributions obtained from personal contact with patients (Table 1) and from judging 
the modified clinical reports with his own original 25 cases excluded. The question 
raised here is whether or not there is the same kind of significant difference in the 
diagnostic percentage distributions obtained from the judges, when the evaluations 
are made from two different types of data. Chi Square analysis indicated a statis- 
tically significant change for Judge A only (Chi Square = 10; p <.05). Judges B, 
C, and D consistently assigned approximately the same proportion of cases to each 
of the diagnostic categories in both situations. Judge A transferred a large group of 
his cases from the neurotic to the personality category in the second judging situa- 
tion. The findings indicated that the clinicians assigned significantly different pro- 
portions of patients to the various diagnostic categories and, in addition, three of the 
four judges consistently applied these percentages when dealing with different diag- 
nostic materiajs 

(c) The Individual Modified Reports. Since all four judges had made a diag- 
nostic judgment on the identical 100 modified reports, it was possible to get a measure 
of interjudge agreement by first comparing all possible pairs of clinicians on the same 
cases. This was done by counting the number of times any two specified judges 
agreed by diagnostic category for each of the 100 cases. Agreement between all 
possible pairs of judges expressed as a percentage was: 63% for A with C; 62% for A 
with D; 60% for A with B; 58% for C with B; 58% for D with B; and 53% for C 
with D. The very narrow range of these percentages of agreement suggested that no 
one judge differed markedly from the other. This indicated that the judges were 
relatively homogeneous in making these evaluations. 

The findings indicated that any two of the clinicians agreed on a diagnosis ap- 
proximately 60% of the time when evaluating the modified reports. The question is 
raised as to whether or not this 60% represents a core of cases which would reflect 
simultaneous agreement by all four judges In an effort to examine this further, 
agreement of all four judges on the same 100 cases was studied. The results in terms 
of the interjudge agreement by diagnostic category of the evaluations made on the 
same 100 reports were: 32% of the cases reflected unanimous agreement by all four 
judges; in 40% of the cases there was agreement by three of the four judges, in 15% 
of the cases two judges agreed on one diagnostic category, while the other two judges 

on a different diagnostic category ; in 12% of the cases two of the judges 
agreed on a diagnostic category and the other two judges disagreed; and with one 
case there was complete disagreement. These findings suggest that 60% of the cases 
does not represent the minimum number with which there would be agreement 
between all four judges 


INTRAJUDGE AGREEMENT 


The data include repeated diagnostic judgments made by each psychologist on 
(a) clinical reports; and (b) test data which he had initially completed. 
Procedure and Results. (a) Clinical Reports. Intrajudge repeatability was deter- 
mined by counting the number of perfect agreements between the diagnoses grouped 
by categories made by each of the judges on his own original 25 reports and the same 
reports with identifying data deleted, and dividing by 25. The percentage of agree- 
ment between the repeated evaluations for each judge was: 68% for Judge D; 64% 
for Judge A; 64% for Judge B; and 52% for Judge C. There was no statistically 
significant difference in the number of perfect agreements for the judges as measured 
by the Chi Square test. The average agreement for all the judges was approximately 
60%. These findings suggested that each judge was equally consistent in repeating 
his own evaluations. 

(b) Test Data. Each clinician was presented with the test data of 25 cases he had 
worked-up originally and was asked to make a diagnosis. Approximately two weeks 
later each judge was asked to repeat his evaluation of the same test data. The 
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judges made their evaluations independently of the others. The percentage of agree- 
ment between the repeated judgments grou ath by categories for each clinician on the 
same test data was: 64% for Judge A; 60% for Judge B; 60% for Judge C; and 56° 
for Judge D. Inspection of these percentages suggests that there are no striking 
differences among the judges The average agreement was 60‘ o. One concludes that 
the judges were equally consistent in repeating their evaluations based on test data. 
It is to be noted that these findings are about the same as those determined in the 
situation where the judgments were made solely on the clinical reports. 


DISCUSSION 


The judges differed significantly in the proportions of their diagnostic evalua- 
tions grouped by categories, and in addition, tended to consistently apply these pro- 
portions. This finding may be explained in several ways: (a) th e clinicians may have 
consistently focused on different facets, symptoms, or levels of patient behavior in 
deriving a diagnosis according to their theoretical persuasion and experience, 7.e., 
there is no one ‘right’ diagnosis; (b they may have evaluated the same diagnostic 
features, but repeatedly used the diagnostic nomenclature differently; and (c) the 
judges may have had preconceived proportions by which they typically grouped the 
patient population. 

However, when one considers that inter- and intrajudge agreement was on the 
order of 60%, one may question the stability « ie diagnostic criteria used by the 

ans in reg their evaluations. 

In assessing the reliability findings derived from the psychologists’ evaluations 
of the clinical re ep rts whic . h id identifving information deleted, and the test data, 

onsideration is that the ‘dliies did not have face-to-face contact with 
pat ts. It should be noted that both these research situations are essentially arti- 
ficial in that clinicians do not make a diagnosis in this manner in their clinical 
practice Chis.leaves unanswered the question oi the possible influence that direct 
contact with patients ml ght have had on the reliabi ity ol these judgments made by 
the clinicians. However, the finding of a significant difference among the clinicians 
in the proportions of their diagnostic evaluations grouped by categories, the clinical 
data having been obtained by direct contact with patients, suggests the possibility 
that the marked variation in the diagnostic judgments derived from both the modi- 
fied clinical re port and the test data might not have been materially lessened if the 
J idges had a interviewed the patients The results of this study indicate that a 
atisfactory degree of reliability of diagnostic judgmen was not consistently ob- 
tained in either the realistic clinical situation or in the basic: lly artificial ones 

The seeming dis repancy between the significant erences in the proportions 
of cases classified by diagnostic category and judge repeatability of the magnitude 


of 60% is very likely a reflection of what might be termed the classification of core 
versus non-core cases. A core case can be cons red one in which the judges agree 
on its diagnosis; the opposite situation would prevail in varying degrees with the 
1on-core cases Is ver’ robable th with certain clinical manifestations there will 


? 
i 
1 
' 
i 


ye complete | ge § emen 1 the diagnosis, while with other kinds of cases the 
judges would be more prone to disagree on the diagnosis. An example of the former 
situation might involve classifying the patient who is delusional and hallucinating, 
while in the latter situation : atient who presents a less striking symptomatic pic- 
ture would more probably be diagnosed differently by the clinicians. The data in 
Table 1 support this hypotheais, ince marked diagnostic differences among the 
judges occur in the personality and i the psycho neurot classifications In terms of 
this possible explanation the apparent discre pancy v due to the judges’ having 
agreed on the core cases, but the non-core cases ‘re pro} rtionately large enough in 
number and diagnosed sufficiently different as to significantly influence the statistical 
distribution of all the cases evaluated. As suggested by the data derived from the 
judges’ evaluations of the same 100 modified clinical reports, the core of cases agreed 


f 1] 


upon by the four judges is considerably less than 60% of all the cases. 
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This study attempts to assess the reliability of the final diagnostic judgments of 
clinicians. It would seem appropriate in terms of future research to study those 
factors which initially influence diagnostic decision-making by psychologists. The 
factor of direct patient contact would seem particularly crucial to evaluate further. 
It would also be of Lue LO ¢ larify the characteristics of core and non-core cases. 

The fundamental question raised in this study was whether or not the diagnoses 
given to groups of patients a to a significant degree upon which clinician 
made the evaluation The results suggest that the answer to this question is sub- 


SUMMARY 


This study attempted to evaluate inter- and intrajudge agreement in making 
diagnostic judgments. The two general hypotheses investigated were: (a) the judges 
agree in their diagnostic evaluations; and (b) each judge is consistent in making re- 
peated diagnostic evaluations Experimental proces dures to test these hypotheses 
were described. The results indicated that there was significant variability 


riat among the 
juages in the proportions ol the lI diagnostic evalWations grouped by categories. The 


clinicians also tended to apply these proportions in a consistent manner. Interjudge 





agreement ‘d on evaluating modified clinical reports was on the order of 60% 
Intrajudge ement based on making repeated evaluations of the same test data 
is so app mately 60% The core of cases with which there is unanimous agree- 
ment by the judges IS app rently consi dera tb lv less than 60%, of all the cases of the 
type evalu ited in this study The overal ndi ng of this study suggests that the 
diagnoses given to gt ups of patients are in large measure dependent on which clin- 
iclan makes the evaluations. Suggestions for future research are indicated. 
,EFERENCES 
l American Psychiatric Association, Committee on Nomenclature and Statistics. Diagnostic 
an wistical manual for mental disorde Washington: American Psychiatric Association, Mental 
Hospital Service, 1952 


THE VALIDITY OF CLINICAL JUDGMENTS OF SCHIZOPHRENIC 
PATHOLOGY BASED ON VERBAL RESPONSES TO 
INTELLIGENCE TEST ITEMS'! 

NELSON F. JONES, JR. 

Northwestern University 
The reliability of clinical judgment has been the subject of a series of studies 


using single responses to intelligence test items obtained from psychia tric patients, 


and judgments by trained clinicians based on a seven-point rating scale It has been 
possible to demonstrate reliability in judgments of disorganization in schizophrenic 
thinking © ‘) as well as more specific aapecta of se none eT thinking © 6 


The question now arises whether clinicians can make global appraisals of com- 
plete test protocols with as great reliability se ty sk indie alias tad items. This 
paper extends the study of reliability, defined as interjudge agreement, to global 


‘This study is part of a larger project being conducted at Northwestern University under the 
lirecti f Professor William A. H through contract 7onr-450(11) with the Office of Naval Re- 
search. Facilities for collecting the experimental materials and establishing tl 


he eriterion were pro- 
vided by the VA Hospital, Hines, Illinois, The opinions expressed, however, are those of the author 
and do not represent opinions or policy of the Naval service nor of the Veterans Administration. The 
study was submitted to the Graduate School of Northewstern University in partial fulfil 4% of the 
requirements for the Ph.D. degree. The author wishes to thank Dr. W. A. Hunt, Dr. R. I. Watson 
Dr. E. L. Clark, and Dr. Janet A. Taylor for their guidance and assistance 
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ju lome nt hole protocols are I inked according to severity of schizophrenic 
pathology t also introduces a validity measu 1e validity of the judgments 
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The following instructions were given to all experimental judges: 

We are going to — *nt you with a number of sub-test protocols taken from intelligence 
scales. One of the ways in which the pathology of schizophrenia may express itself is through 
disordered thinking aes results in atypical, deviant or “abnormal’’ responses to the items 
on such a test. The qualitative interpretation by the clinician of such test responses is one of 
the bases upon which he makes a clinical or diagnostic interpre tation. All the individuals 
who contributed these protocols were diagnosed as s« hizophrenic. None of the cases was com- 
plicated by mental deficiency or known organic conditions. The extent of the pathology 
exhibited, however, is not uniform. In some cases it is less extreme than in others. 

We are going to ask you to rank these protocols according to the severity of the pathology 
exhibited in the responses and to indicate on the attached form the order in which you rank 
them. In making these penne, § we are asking you to concentrate upon the severity of 


the pathology exhibited in the responses. We are not interested in such things a8 Prognosis, 
chronicity, therapeutic indications, or sub-type de signation. In essence, what we are asking 
you to do is judge how “‘schizophrenic”’ each of these records is in rel: ation t o the others pre- 
sented with it 
Inter-judge reliability within each condition was determined by correlating each judge’s 
ranking with the mean ranking of the other fifteen members of his group, using Spearman’s Rho. 
Validity coefficients for each judge were determined by comparing his ranking to the criterion 
ranking, again using the Spearman Rho 


RESULTS 
Criterion selection. Although each criterion judge was acquainted to a greater 
extent with some patients than with others prior to the time at which the criterion 
was established, a fact which would be expected to lower agreement on the ratings, 
inter-judge reliability was quite high. Table 1 summarizes the intercorrelations be- 


TABLE 1. SumMMARY OF CRITERION CORRELATIONS 


M iterial ( ‘orrelated 
and Statistic Used 1 and 2 


Interview Ratings: 
ron 12 patients 
Test-History Ratings: 
ron 12 pat lents 
Individual Rankings: 
Rho on 12 patients 


Agreement of E: Judge with 
Conference more Rho on 
12 patients .916** 
*Sig! ificant at the .05 le vel of con fider 
**Significant at the .01 level of confider 


tween judges in establishing the criterion. Ratings based on the interviews and on 
the test and history materials were absolute judgments based on a seven-point scale 
and were compared by the Pearson’s r correlation. Independent rankings of the 
twelve patients were compared using Spearman’s Rho, as were each judge’s rankings 
and the conference results. From the ranking of the oo lve patie nts established in 
the conference, the judges chose the patients ranked ‘‘1’’, ‘3’, ‘‘5’’, ‘7’, “8”, “107, 
and ‘‘12’’ as most clearly discriminable. 

Six months later a follow-up was made of the hospital status of these patients. 
Their status then is listed below in order of their ranking. ‘‘1’’ Discharged, maximum 
hospital benefits; ‘‘3’’ Discharged on trial visit; ‘‘5’’ Discharged on trial visit, in 
out-patient treatment; ‘‘7’’ Transferred for long-term hospitalization elsewhere; 

8’’ Transferred for long-term hospitalization elsewhere; ‘‘10’’ Discharged on trial 
visit; “‘12’’ Transferred for long-term hospitalization elsewhere. 


Experimental results. Using a z-score transformation of the individual reliability 
and validity Rhos, a mean correlation was determined for each of the six experimental 
groups for each measure. This information appears in Table 2. It is clear that the 


mean reliability of even the poorest group is quite high, indicating that the task can 
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TaBLeE 2. MEAN RELIABILITY AND VALIDITY COEFFICIENTS FoR Eacu 
EXPERIMENTAL Group oF 16 JUDGES 


Partial Whole Whole Vocabulary 


ii 


Condition Vocabulary Vocabulary and Comprehension 


Clinicians 
Reliability 955 926 862 
Validity 686 611 711 
Undergraduates 
Reliability 834 867 762 
Validity 446 130 .485 


be accomplished reliably even by the group of inexperienced judges who received 
the largest amount of material, while the group of experienced clinicians using the 
smallest amount of material agree almost entirely in their judgments. These results 
agree with those discussed by Hamlin 

For validity, we find evidence almost as clear as that for reliability. The signi- 
ficance of the validity measures is indicated further by the fact that all forty-eight 
of the forty-eight clinicians’ individual validity coefficients fell in the predicted 
direction of a positive correlation with the criterion, and forty-two of the forty-eight 
undergraduates’ correlations were also positive. Twenty-one clinicians exceeded the 
.05 level of confidence in their ratings, and nine undergraduates did so. While exper- 
ience and training make the clinician a better judge, it is obvious that this task is 
being performed at a level more accurate than would be expected by chance 
Differences in reliability and validity attributable to experience and training or 
to the amount of material used in making these judgments were tested for significance 
by using the individual reliability and validity coefficients, as converted to z-scores, 
in analyses of variance. The six combinations of experience and amount of material 
presented were the conditions compared Hartley’s test for homogeniety of variance 


gave no indication that analysis of variance was inappropriate. 


TABLE 3. I. ANALYSIS OF VARIANCE OF RELIABILITY COEFFICIENTS 

Source of Variation Sum of Squares df 
Experience 50,738 
Amount of Material 34,642 
Interaction 6.445 

3etween Groups 91,825 

Within Groups 213,139 
Total 304,964 


II. ANALYSIS OF VARIANCE OF VALIDIT’ FICIENTS 


Source of Variation Sum of Squares lf ean Square 
Experience 25 058 1 5 058 
Amount of Material 2,409 204 
Interaction 568 284 
Betweer Groups 9R 035 607 
Within Groups 109,761 1,219 
Total 137,796 


**Significant at the .01 level of confidence 


The difference in performance between clinicians and undergraduates indicates 
a clearly significant increase in reliability as a result of increased training and exper- 
ience. The increase inamount of material, however, results in a significant decrease 
in reliability. The non-significant interaction effect between experience and amount 
of material indicates that, contrary to expectation, experienced clinicians are not 
relatively better than undergraduates at agreeing with each other when increased 
amounts of material are presented to them. 








100 NELSON F. JONES, JR. 


In the case of validity, the clinicians again excel over the undergraduates. 
However, increasing the amount of material to be judged did not produce a signi- 
ficant change, the interaction effect is again non-significant. 

It was possible to separate the clinica! judges by level of prey ious experience on 
our prior rating studies and to test by analysis of variance, the differences between 
the three levels of prior experience. No significant differences in either validity or 
reliability were found which could be attributable to differing amounts of experience 
with the previous tasks, showing such practice effects to be less important here tha 
in a previous study of context effects 

While both clinicians and undergraduates agreed with the criterion at a better 
than chance level, none of the groups’ mean judgments agreed exactly with the 
criterion ranking of the patients. Distribution of the percentages of agreement b& 
tween the 112 individual judgments made by each group and the criterion ranking, 
while another way of presenting the same trend shown in Table 2, indicated that as 
the magnitude of error in a single case increases, the likelihood of that error being 
made rapidly decreases in this san iple of ju idgments. This, of course, indicates a high 
probability of accuracy bs the eliniead eitealion Where cate Soleaadie ann vigiion’, 


SUMMARY 

Forty-eight trained clinicians and forty-eight undergraduates ranked three 
amounts of Vocabulary and Comprehension sub-test responses according to how 
schizophrenic they felt the person contributing the protocol to be. Interjudge re- 
liability and correspondence to an external validity measure were determined. Both 
clinicians and undergraduates were more reliable and more valid than chance 
expectations. The clinicians were significantly more reliable and closer to the 
alidating criterion than were the undergraduates. As the amount of material pre- 
sented to be ju d; ged inc reased, reliability decreased for both clinicians a 1d under- 
rraduates, but validity y was not affected. Clinicians were relatively no better at 

aac: tsetse amit Gk wiaialalk uaa ues Uae aadiindnaieled 
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A COMPARISON OF RANDOM AND JUDGMENTAL METHODS OF 
DETERMINING MODE OF OUTPATIENT MENTAL 
HYGIENE TREATMENT 
FLOYD K. GARETZ!, RICHARD C. KOGL”, AND DANIEL N. WIENER? 
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PROBLEM 

Much has been written about ways of deciding how to treat patients with 
emotional problems. The best way of arriving at a clinical judgment for treatment is 
presumably provided by the ‘“‘team’’, consisting of psychiatrist, psychologist, and 
social worker, each contributing his specialized knowledge to the decision“: ® 4 4), 
[In a recent study, Wiener and Raths®? questioned the efficacy of clinical judgment 
in deciding upon the best form of psychotherapy. They found that each of the three 
disciplines on the teim—psychiatry, psychology, and social work—was about equally 
accurate in predicting what the therapist would, at the conclusion of treatment, 
describe as the preferred method of therapy. No group was very good, however, nor 
did meeting together as a team improve the predictions significantly 

To us, the usefulness of clinical judgment as a way of deciding on the form of 
psychotherapy was strongly suspect. We decided to try to compare its relationship 
to success of treatment, with the success following assignment of cases to a particular 
therapy randomly. 

METHOD 

Twenty-seven male white veterans applying for treatment at a mental hygiene 
clinic were assigned either to a “clinical judgment” group (Group 1), or a “random 
number” group (Group II). In Group I, 13 patients were prescribed and received 
drug therapy or psychotherapy as determined by the best judgment of the clinic 
team and the psychotherapist. The 14 patients in Group I! were assigned to either 
drug therapy, or psychotherapy, from a random number table, and the therapist was 
bound to this “‘artificial’”’ decision 

Two psychiatrists treated all the patients. They chose drugs at their own dis- 
cretion, and saw ‘‘drug therapy” patients for no more than 15 minutes each, twice a 
month. Psychotherapy patients were seen weekly for 30 to 50 minutes. After three 
months of treatment, the therapists rated all patients on a five-point scale of treat- 
ment success ranging from “‘worse’”’ to markedly improved ig 


RESULTS 

The data were analyzed in a fourfoid table: 14 randomly-assigned cases, and 13 
cases assigned through clinical judgment, each subdivided into “improved,” and 
“unimproved” groups. ‘“‘Unimproved” included the “unchanged” and ‘“‘worse’’, and 
“improved” included those showing any degree of improvement. Three cases of each 
group were unimproved, 11 and 10 of the ‘‘random”’ and ‘“‘clinical judgment” groups 
respectively were imopro\ ed. The difference was not significant. Because of the small 
number of patients, we were not able to subdivide our results (as we had planned 
according to form of treatment. Nor could we readily generalize from such sparse 
data. 


SUMMARY AND CONCLUSIONS 
Our initial hypothesis was that current methods attempting to match patient 
and mode of outpatient therapy by clinical judgment are of questionable validity. 
We studied the results of assigning patients to therapy randomly, as compared with 
clinical judgment, and found the improv ed rates to be not significantly different. We 


‘Psychiatric resident, VA Hospital, Minneapolis, Minn 
*Psychiatrist, private practice, Minneapolis and VA Mental Hygiene Clinic, St. Paul, Minn 
8Chief Clinical Psychologist, VA Mental Hygiene Clinic, St. Paul and University of Minnesota. 
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concluded from our data, considered with the other results cited, that more caution 
should be exercised in current methods of deciding clinically which patient should 
have any specified form of outpatient therapy. 
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A COMPARISON OF PARENTAL AND SELF-EVALUATIONS OF 
PSYCHOPATHOLOGY IN CHILDREN! 
EUGENE E. LEVITT 


Indiana University Medical Center 


The mother of a child guidance clinic patient is usually considered to be a 
primary source of information about the child, even though it is thought that they 
frequently have a distorted perception of their child. Nevertheless, the mother’s 
statement must often be accepted as valid in the absence of contradictory informa- 
tion. It is a sort of ‘“‘best estimate’’ of the child when no other estimate is available. 
Despite the prevalence of the belief in the fallibility of the mother’s perception, there 
have been few experimental efforts to check it. The job of rounding up estimates of 
the child other than that of the mother is a tedious and time consuming one. Even 
if it could be accomplished reliably, there is always the possibility that differences 
in the perception of the child by the mother or teacher are a function of actual differ- 
ences in the child’s behavior in two different settings. By the time the clinician ob- 
tains sufficient data through observation of the child to check the mother’s report, 
the contacts with the clinic may have changed the child appreciably. 

Helper, in one of the few studies bearing on the question, found low, positive 
correlations between self-evaluations of a group of 8th and 9th grade children and 
evaluations of them by their parents. 

This experiment attempts to check the mother’s perception of the emotionally 
disturbed child. 

SUBJECTS AND PROCEDURE 

The Children’s Manifest Anxiety Scale (CMAS) “ was administered to 73 con- 
secutive children between the ages of 8 and 13, who were seen diagnostically at a 
large community child guidance clinic. In view of the high incidence of reading de- 
ficiencies among the clinic’s patients, each item in the scale was read to the child by 
the psychologist and his responses recorded.? Simultaneously, an identical form of 
the test, except that the pronouns ‘‘I’’, ‘“‘my’’, etc., were replaced by “my child’’, 
a etc., Was administered by the social worker to the mother of each child. The 
mothers were told that they could omit an item if they lacked sufficient information 
with which to respond to it. In this way, the child and his or her mother described 
the child’s symptoms by means of an identical questionnaire. 

1The experimental work summarized in this paper was supported by the Institute for Juvenile 
Yesearch of Chicago, an agency of the Department of Public Welfare of the State of Illinois. It was 
read at the 1959 meeting of the Society for Research in Child Development in Bethesda, Md. 

2Wirt and Broen) found that individual and group administrations of the CMAS yield similar 
results. 
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RESULTS 

The mean score on the 42 anxiety items (the A-Scale) for the children’s responses 
was 20.82 with a SD of 8.14, practically identical with the mean of 20.96 and SD of 
7.90 obtained for an earlier sample of a hundred cases from the same clinic“, but 
significantly higher than the means reported for several different samples of normal 
children of comparable ages®?. 

The mean score for the mothers was 18.82, SD 7.47. The difference between 
means, analyzed by an appropriate t-test for correlated means, is not significant. 
Furthermore, if we prorate the mother’s scores for omitted items, the mean becomes 
20.19 with a SD of 7.76, practically identical with that of the children. A similar 
lack of difference prevails for the 11 Lie Scale items. The mean for the children was 
4.07, SD 2.27. For the mothers, the mean was 3.30, SD 2.44. The ¢t of 1.91 is not 
significant. Prorating for omitted items, the mean for the mothers becomes 3.44 with 
a SD of 2.39. We see, then, that on the average, the children and their mothers 
perceived the children as disturbed to the same degree. However, the correlation 
between scores for pairs of mothers and children on the A-Scale unprorated is .14, 
prorated, .09. The respective figures for the Lie Scale are —.07 and —.02. None of 
these coefficients is significant. 

Considering the degree of agreement item by item for all items between pairs of 
mothers and children, not counting items omitted by the mother, we find that the 
agreement is 56.69% which does not differ significantly from the chance expectancy 
of 50% (t = 1.15). The agreement on the A-Scale items alone is 57.53%, on the Lie 
Seale items, 56.46%, a nonsignificant difference 

If we count one for each item agreement and zero for each non-agreement and 
then compute an items X subject analysis of variance, we find significant F-ratios for 
items on both the A-Scale and the Lie Scale (the respective F's are 4.23 and 2.69, both 
significant at the .01 level). This suggests the importance of noting those items on 
which there was a greater than chance agreement. There were 10 such items on the 
A-Scale and three on the Lie Scale. The agreement on these 13 items ranges from 62 
to 73%. The 10 A-Scale items do not seem to cluster in any logical fashion.* Four 
seem to reflect inner states, two are somatic symptoms, three are other observable 
states, and one concerns school work. 


DISCUSSION 

Children and their mothers agree on the average degree of psychopathology 
among the children, and this average agrees with what has been found previously for 
disturbed children. However, the lack of correlation between the scores of mothers 
and their children, and the lack of item by item agreement, means that the in- 
dividual mother on the average perceived her child’s symptoms differently than he 
did. Since the clinic is usually concerned with the nature of symptoms, we should 
attempt to determine which one is perceiving realistically, mother or child. 

The implicit assumption in the self-administered adjustment inventory or 
similar device is that an individual knows his own state better than any outside 


®The A-Scale items were: 
7. At times (I) (my child) feels like shouting. (65% agreement) 
~ At times (1) (my child) wish (I) (he) could be very far from here, (65% ) 
3. (I) (he) feels alone even when there are people around (me) (him). (73%) 
5. (1) (he) get hervous when things do not go the right way for (me) (him). (63%) 


1 
15. 
19. Often (I) (he) have trouble getting (my) (his) breath. (64%) 
23. (1) (he) have to go to the toilet more than most people (67%) 
26. (I) (he) have trouble swallowing. (72%) 

37. (I) (he) often get lonesome when (I) (he) am with people 
39. (I) (he) am afraid of the dark. (63%) 

40. Itis hard for (me) (him) to keep (my) (his) mind on (my) (his) school work. (62%) 


(69%) 


The L-Scale items were: 

10. (I) (he) would rather win than lose in a game. (63%) 
30. (I) (he) am always good. (70%) 

36. (1) (he) tells the truth every single time. (65% 
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observer. Looking at it this way, we must give the child the benefit of the doubt and 
assume that his is the clearer perception. Furthermore, Rynerson“™? found a positive 
correlation between the child’s perception of the severity of his symptoms based on 
the CMAS, and estimates of severity by the examining psychiatrist in another group 
of patients from the same clinic. 

Examination of the protocols suggested two possible explanations for the lack of 
correspondence of pairs of scores. It is reasonable to suppose that the symptoms of 
children who are labelled as acting-out, -t.e., those whose pathologica] behavior is 
more manifest, will be better known to the parents than those of children who do not 
act out. In our sample, there were 15 children labelled as acting-out problem cases 
by the psychiatrist. The mean percentage of agreement between mothers and child- 
ren in this group is 59.01 which does not differ significantly from the percentage for 
the remaining 58 cases (a median test chi-square is 0.43.) We find, however, that the 
parents of five of these 15 children had better than chance agreement with the child. 
This is 33.3% as opposed to 11.9% for the remaining 58 cases and 16.4% for the 
total N of 73. The difference in the number of cases in which there is a greater than 
chance agreement between the acting-out cases and the other cases is significant, 
yielding a probability of .05 by Fisher's exact method. We must therefore consider 
that one possible explanation of the lack of agreement on symptoms is simply that 
the parents are unaware of many of the symptoms because they are not manifest. 

A second possibility was that the lack of agreement was a function of excessive 
guessing concerning the child’s symptoms by the parent. This hypothesis was tested 
in the following manner. There were 14 mothers who omitted seven items or more 
and 37 mothers who did not omit any items. The average percentage of agreement 
with the child’s responses for the former group was 61.28 and 55.63 for the latter 
group. The difference is not significant when tested parametrically (t = 1.68) but is 
significant with a nonparametric test (a median test chi-square is 5.87, p < .05). 
Further scrutiny of the protocols indicated that the discrepancy between the para- 
metric and nonparametric tests was a function of a single, unusually low degree of 
agreement (26.2%) in the group of mothers who had omitted seven or more items. 
If this score is eliminated, the mean agreement for that group rises to 63.98%, the 
pooled variance is reduced by 15%, and the ¢ is now 2.63, p < .05. We conclude that 
this is another possible explanation of the lack of agreement between the responses 
and scores of mothers and children. 


SUMMARY 

The purpose of this study was to attempt to assess the perception of the symp- 
toms of an emotionally disturbed child by the mother. The CMAS was administered 
to 73 clinic patients and also filled out for them by their mothers. There was no 
difference between mean scores for mothers and children, and both were higher than 
means obtained from normal samples. There was, however, no correlation between 
pairs of mother-child scores, and item-by-item agreement between pairs was not 
greater than chance. Some evidence was presented which suggests that mothers may 
be unaware of their children’s symptoms. 
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STANDARDIZATION OF THE CHILDREN’S PICTURE 
INFORMATION TEST! 


KATE L. KOGAN AND RICHARD L. CRAGER 


Department of Psychiatry, University of Washington 
University of Washington School of Medicine 


PROBLEM 

The need for a relatively rapid technique of assessing intellectual development 
in young children with serious motor handicaps had led to the development of the 
Children’s Picture Information Test (CPIT).° The test was designed for use with the 
population of the Spastic Children’s Clinic and Preschool in Seattle, Washington. 
In a preliminary study of 50 young cerebral palsied children the correlation between 
CPIT and standard psychometric measures was .82°). These findings made it ap- 
parent that further refinement of the instrument and its standardization on a non- 
handicapped population was warranted. 


METHOD 

The Test. Several minor modifications were made in the test material before the 
standardization procedures were initiated. The original form had consisted of 34 sets 
of items; six additional sets of items brought the total to 40. Originally the several 
pictures on a single page were mounted on different colors; in the standardization 
form all pictures on a single page were mounted on the same color, so that color pre- 
ference could not possibly influence the child’s selection of a response. 

The method employs a multiple choice task requiring the child to indicate 
the one picture (of four possibilities) which belongs with each key picture. An effort 
was made to restrict the content largely to everyday household situations so that the 


immobilized child of confined environment would have had a chance for familiarity 
with the majority of the objects pictured. The sets of pictures are mounted on large 
pages in book form. Colored pictures were selected from picture dictionaries and 
other object picture books designed for young children. 


SUBJECTS® 

Standardization. The experimental design called for administering the test to 50 
children in each of eight six-month age intervals between 2 years 0 months and 5 
years 11 months, making a total of 400 children. It was recognized that there was no 
easy source of a heterogeneous population sample at this age level, comparable, for 
example, to using classroom groups of older children. Therefore, as partial insurance 
against biased selection, the children in each age group were classified according to 
their parents’ occupations so that the distribution would conform to the distribution 
of occupations according to 1950 US census figures. Attempt was also made to 
balance the groups as evenly as possible for numbers of boys and girls, and for age in 
months within the six month range. 

Since the majority of subjects were made available through various kinds of 
nursery school groups, it was necessary to examine all children in any given group 
who expressed a desire to participate. In this way a reservoir of data on 568 children 

IThis study was initially supported by a grant from the Dean’s Cerebral Palsy Fund, University 
of Washington School of Medicine, and was completed under a research grant #B-1611 from the 
National Institute of Neurological Diseases and Blindness, Public Health Service 

2The CPIT test material is assembled by hand by the Spastic Aid Council, 1850 Boyer Avenue, 
Seattle, Washington. It will be provided to professional workers at a nominal charge to cover costs of 
assembling and postage. 

3The investigators are indebted to the Seattle Day Nursery Association, the Family Life Division 
of the Seattle Public Schools, and the W. C. Bryant Elementary School for generously making their 
extensive facilities available; the YWCA Nursery, University Preschool and Kindergarten, Carousel 
Nursery School, Southtown Preschool, Gingerbread House, and Lockhaven Nursery also gave in- 
valuable cooperation. 
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was accumulated. Within each age group the children’s records were arranged in the 
chronological order in which they were tested; the first fifty children who best fitted 
the ideal distributions of sex, age in months, and parents’ occupation were selected 
for the standardization population. The characteristics of the subjects included in 
the standardization are detailed in Table A of the Appendix.‘ The occupational 
distributions obtained in the various age subgroups were not significantly different 
from the model. 


Validity Group: Correlation with Stanford Binet. A group of 50 subjects was 
selected from the total reservoir available so that their distribution of ages from 2-0 
to 6-0 was balanced and so that the occupation of the parents approximated the 
census groupings. Each subject was given the Revised Stanford Binet, Form L, in 
addition to the CPIT. For practical reasons it was necessary to use only children 
whose parents could make them available at a time other than a regular nursery 
school session, for example, at their own home by appointment, or at a neighbor’s 
home. This was, then, more strictly a volunteer population than the larger standard- 
ization population, and subject to the distortions inherent in sampling volunteers. 
The mean IQ for the group was 109.1 with a range from 83 to 142, and a standard 
deviation of 13.7. The detailed composition of the group is given in Table B of the 
Appendix. A Chi-square test for both occupational deviation from the US census 
model, and for equal distribution of ages throughout the range yielded no significant 
deviations. The sex distribution was somewhat less than adequate, with 35 males and 
15 females in the group. 


Ha dicap pe d Group: Correlation with Stanford Binet. The test material had been 
supplied on an experimental basis to a number of other professional workers in this 
country and abroad for experimental use in clinical situations. The senior author 
had also continued to use the instrument with the clinic population at the Spastic 
Children’s Clinic and Preschool in Seattle, Washington. From these sources the 
records of 59 children were selected to whom both the Revised Stanford Binet and 
the CPIT had been successfully administered. 


Test-Retest Re liability Group. A group of 50 subjects was selected from the totad 
resery oir available, in the Same manner as described above for the validity group. 
Each subject was given a second CPIT at an interval of 1-3 weeks from the date of 
his first test. Occupational and age distributions were not significantly deviant from 
the model. There were 26 males and 24 females. The group statistics are given in 
Table C of the Appendix. 


RESULTS 
Scores by Age Intervals. The total raw score (number of correct responses) for 
each subject was tabulated, and the data analyzed by age-interval groups. The find- 
ings are summarized in Table 1. Observation indicated that both mean and median 


ABLE 1. DISTRIBUTIONS OF ScorEsS BY AGE INTERVALS 
Standard 
Age group Median Mes Deviation 








2-0 to 2-6 ‘ 5.2 
2-6 to 3-0 5.¢ 
3-0 to 3-6 3 8.2 ¢ 
3-6 to 4-0 
4-0 to 4-6 26 ).4 
4-6 to 5-0 . 31. 3.3 

} 


6 
) 
) 
5-0 to 5-6 


5-6 to 6-0 





‘An Appendix giving more detailed data about subjects and results has been deposited with the 
American Documentation Institute. Order Document No. 5914, remitting $1.25 for 35-mm. micro- 
film, or $1.25 for 6 x 8 inch photocopies. 
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scores increased from each age group to the adjacent higher age interval. To test for 
significance the Median Test appeared to be the measure of choice for our purposes 
because Bartlett’s Test“ showed heterogeneity of variance was present; also raw 
score distributions were skewed, shifting from a marked positive skew at early age 
levels to a marked negative skew at upper age levels where the ceiling of the test was 
reached. 

Each median with that of the next higher age interval was tested for signifi- 
cance“®), All tests were significant (p < .01) except the comparisons of the two 
lowest age groups and the two highest age groups. The results indicate, therefore, 
that the total raw score discriminates adjacent six-month age interval groups be- 
tween the ages of 2-6 and 5-6. 

When the age interval groups were subdivided according to sex, it was noted 
that females tended to score slightly higher than males at each age range, with one 
exception. (See Table D of the \ ppendix However, the correlations between sex 
and median score were not significant except in the highest age group, 5-6 to 6-0, 
where the Phi coefficient reached .36 “ 

Age interval groups were also subdivided into two categories according to 
parents’ occupation, in order to compare the effects of socio-economic status. One 
group included the occupations listed under professional, managerial, clerical, and 
sales categories and the other group combined the occupations listed under crafts- 
men, operatives, service, and labor. There was a slight tendency for the children of 
higher socio-economic status to obtain higher scores, but the correlation between 
parents’ occupation and CPIT score was significant only for one of the eight age- 
interval groups. (Age range 2-6 to 3-0, Phi coefficient .28.) It appeared, from these 
results, that the CPiT was, on the whole, not significar tly related either to parents’ 
occupation or sex of subject within the age range studied 

CPIT and Revised Stanford Binet. The CPIT scores of fifty subjec ts were cor- 
related with their mental age ratings on the Revised Stanford Binet, Form L. A 

“scatter plot justified the use of the product-moment correlation which was .89 + .03. 
The CPIT scores of 59 handicapped children were similarly correlated with their 
mental age ratings. The obtained correlation was .80 + .05. 


Item Analysis and Weighting. The preliminary research had found that assign- 
ing weighted scores to the CPIT items improved the correlation with Stanford Binet 
scores for the group of handicapped children under investigation. There had been, 
at that time, no opportunity for cross-validating these weights in a replication 
study. The present research furnished data whereby the stability of the original 
weights could ')> tested for the two validity groups (59 handicapped, 50 non-handi- 
capped). In neither group did the weighted scores alter the correlation between 
CPIT and mental age. 

Several] other weighting schemes were explored. One was based on the percent- 
age passing each item in each of the eight age groups of the 400 standardization sub- 
jects. In another approach, using the same data, Phi coefficients were calculated 
correlating the passing of each item with total CPIT score for each of the age interval 
groups. On the basis of the data from the non-handicapped validity group the pass- 
ing of each item was correlated with Binet mental age. (See Tables F and F of the 
A ppendix for these data.) A number of different weighting systems were constructed 
using the above information in various combinations. None of them significantly 
altered the correlations of .89 and .80 obtained between unweighted CPIT scores and 
3inet mental age. 


Reliability. The scores of the fifty subjects to whom two CPITs were admin- 
istered were arranged for test-retest correlation. The product-moment correlation 


of the two sets of scores was .93 + .02. 

Since the test-retest reliability coefficient might be considered inflated due to 
the extended age range used, the test was split into odd- and even-numbered items; 
split-half reliability coefficients were calculated for one-year age interval groups, foul 
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groups, 100 subjects in each. The four product-moment reliability coefficients, cor- 
rected according to the Spearman-Brown formula, ranged from .85 to .90. 


Tables of Norms. The skewed nature of the distributions of raw scores for each 
age group made normalizing mandatory for most efficient test use in practical clinical 
situations. Normalized standard scores (T scores) were therefore calculated. Table 
2 lists the CPIT raw scores and T score equivalents for the 8 six-month age interval 


TaBLe 2. CPIT Raw Scorgs anp T Score EQUIVALENTS FOR SIx-MOoNTH AGE INTERVAL GrovuPS 


Age 





2-0 2-6 3-0 3-6 4-0 4-6 5-0 5-6 
to to to to to to to 
2-6 3-6 4-0) 4-6 5-0 5-6 6-0 
CPIT 
Raw score = = T = T 





ore Oboe 


~ 
og 


7 
8 





groups. Table 3 lists T score equivalents for the raw scores of one-year age interval 
groups. In each of these T score tables a score of 50 represents the mean and the 
standard deviation is 10. Table 4 permits the translation of T scores into centile 
scores, which may be more familiar to some professional workers. 

Interpretation of the results with respect to a child older than 6 years whose test 
score falls within the ranges obtained here is less precise. Presumably the test is ap- 
propriate for children over the age of six who are sufficiently handicapped that they 
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TaBLE 3. CPIT Raw Scores anv T Score EQUIVALENTS FOR 
ONE-YEAR AGE INTERVAL GROUP 


Age 
CPIT 2-0 to 3-0 3-0 to 4-0 4-0 to 5-0 5-0 to 6-0 
Raw score i — “i a 


34 
37 


39 


31 

32 

33 

34 

35 

36 

37 

38 

39 ¢ 
40 f 70 





cannot participate in a more formal psychometric examination, and who are mentally 
retarded to some degree. In the absence of direct mental age equivalents for CPIT 
scores, at this stage of test development, the data in Table 5 permit one to compare 
the score of a child of any age with the obtained age-development patterns of the 
standardization group. 


DISCUSSION 

The data reported in this study establish the fact that the ability to make cor- 
rect responses to the CPIT items increases with age between 2 and 6, and that the 
test is most effective between 2! and 5! years of age. Exactly what skills contri- 
bute to the ability to make correct responses cannot be defined. As the name of the 
test implies, it appears to measure the amount of information the child has assimil- 
ated from his environment. It has been demonstrated that the ability to apply this 
information to selecting the correct response is closely related to intelligence as 
measured by the Revised Stanford Binet. Undoubtedly correct response involves 
the capacity to select critically, and to inhibit immature naming responses, that is, to 
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Taste 4. For TRANSLATING T Scores Intro CENTILES 


* score Centile T score Centile 


58 
62 


‘ 

7: 
dc 
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TaBLe 5. PERCENTAGE oF Eacu AGE Group Wuo Exceep 
CértTAIn CPIT Tora Scores 


CPIT 
Raw score 
100% 
98% 
88% 
69% 
50% 
27° 
18% 62% 
4% 31% 
1% 





hold the test demands in mind, as well as possessing the required factual information; 
attention and concentration also play a role. In the light of these considerations it is 
safe to assume that whatever the CPIT measures it corresponds with one or more 
facets of intelligence. CPIT score may, then, be a way of estimating intelligence in 
much the same way that vocabulary score is often accepted as the best single pre- 
diction of total score on the more complex psychometric instruments. Under no 
circumstances would this kind of measure be preferred if more extensive and varied 
examination is possible and available. In the setting in which the instrument was 
devised, for use with severely handicapped young children whose capacities could 
not be measured effectively by conventional means, the CPIT appeared to provide 
valid information which might not otherwise be available. It is anticipated, also, 
that the test may prove to be useful more generally by supplying confirmatory 
evidence in the evaluation of many preschool aged children. Since this research was 
initiated, Haeusserman’s use of the structured interview in assessing the develop- 
mental potential of young handicapped children has been published). Her very 
painstaking approach can provide a kind of detailed information not inherent in the 
CPIT. The two methods may well complement each other in clinical use according 
to the practical demands of a given situation. 
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It appears from the data obtained in this standardization study that the accum- 
ulation of a larger cross-validation population having both CPIT and conventional 
mental age scores might lead to the establishment of mental age equivalents for 
CPIT scores. There are, however, certain variables which would have to be fully 
investigated. It is possible that such equivalence might prove to be different for 
different age groups. It is also possible that equivalence between CPIT and mental 
age ratings might prove to be different for handicapped and non-handicapped 
groups, and might even be different for different kinds of handicapped populations, 
for example mentally retarded children and children with motor disabilities. Finally, 
as our philosophy of assessment and evaluation in this field continues to grow and 
develop, we may find that we are willing to dispense with the concept of a mental 
age score as making little real contribution to our understanding. 


SUMMARY 

The CPIT was administered to 400 children equally divided into eight six- 
month age interval groups between the ages of 2 and 6. The groups were selected so 
that they approached as closely as possible an optimal distribution of sex, age in 
months, and parents’ occupation. Both mean and median total scores increased from 
each age group to the adjacent higher age interval. The discrimination between 
adjacent age interval groups on the basis of median score was statistically significant 
between the ages of 214 and 514. On the whole CPIT was not significantly related 
either to parents’ occupation or sex within the age range studied 

For fifty subjects selected from the standardization population the product 
moment correlation between CPIT and Revised Stanford Binet was .89; for 59 
handicapped children from a variety of sources the correlation between CPIT and 
Revised Stanford Binet was .80 

Test-retest reliability for a group of fifty subjects was .93; split-half reliability 
for one year age interval groups ranged from .85 to .90 

Normalized standard scores were calculated for both six month and one year 
age interval groups. 


REFERENCES 

Epwarps, A. L. Experimental design in psychological resea New York: Rinehart, 1950. 

GuILForD, J. P. Fundamental statistics in psychology and education. New York: McGraw-Hill, 
1950. 

3. HarUSSERMAN, Exse. Development potential of preschool children. New York: Grune and Stratton, 
1958. 

4. Jurcenson, C. E. Table for determining Phi coefficients. Psychometrika, 1947, 12, 17-29. 

5. Koaan, Kare L. A method of assessing capacity in prescho 
Psychol., 1957, 13, 54-56. 

6. SIEGEL, 5. Non-pa ametric statistics for the behavioral sciences. New York: Mc( raw-Hill, 1956. 


| cerebral palsied children. J. clin. 





THE PAIN APPERCEPTION TEST: AN APPLICATION TO 
SEX DIFFERENCES 
DONALD V. PETROVICH! 


Veterans Administration Hospital, 
Jefferson Barracks, Missouri 


PROBLEM 

The experimental literature reporting sex differences for pain thresholds and 
experiences is concerned primarily with traditional psychophysical measurements. 
While there is tacit recognition that apperceptive factors are important in the 
experience of pain, little has been done to bring in this potentially rich background 
for more comprehensive studies. In order to initiate assessments of this apperceptive 
background, the Pain Apperception Test was developed ‘*’. Thus far, its application 
has been directed mainly to demonstrating its efficacy as a research tool through 
assessing its relationship to some indicators of anxiety and neuroticism, and by 
examining the elements of anticipatory pain responses 

Experimental literature on sex differences and pain is meager and inconclusive. 
Thus Chapman and Jones“, using heat radiation with 200 normal subjects, found 
that the average male had slightly higher pain perception and pain reaction thres- 
holds than the average female, with the differences not being significant. Kennard“), 
using electrical stimulation, found the pain perception threshold higher in males in 
both patient and control groups. Hall and Stride“ observed higher heat-pain re- 
action thresholds in males among psychiatric hospital admissions. On the other 
hand, Stengel, Oldham, and Ehrenberg ®?, using physical pain stimuli such as 
pricking, pressure, and probing, reported ‘‘a small tendency for men to react more 
than women.”’ 

While these studies generally have been of the traditional psychophysical 
variety, it is proposed that eventually more complete studies should include not 
only the psychophysical measurements, but also an assessment of the predispositions 
that tend to determine—perhaps within wide limits—variations of these thresholds. 
Che purpose of the study reported here was to enter the obvious but inadequately 
studied area of sex differences in the predispositions to experience pain. 


MertHop 

Subjects were 100 professional and clerical personnel (50 male and 50 female) 
of Jefferson Barracks Veterans Administration Hospital. Mean age and education, 
respectively, were 39.14 years and 15.32 grades for the men; and 40.18 years and 
13.36 grades for the women. 

The Pain Apperception Test (PAT) was used to assess attitude toward pain as 
reflected in judgments by each subject of the intensity and duration (each on a 7- 
point, multiple-choice scale) of the following 17 pictures: 1, bumped head; 2, cut 
forearm; 3, hit shin; 4, stomach ache; 5, smashed thumb; 6, electric shock; 7, head- 
ache; 8, stubbed toe; 9, broken leg; 10, fall on back off ladder; 11, ice dropped on 
foot; 12, deltoid hypodermic; 13, tooth drilling; 14, shoulder gunshot; 15, burned 
hand; 16, fish hook in neck; and 17, removing foreign substance from eye. Each 
picture was responded to for the questions “‘Huw does the man feel?’’ (intensity 
dimension) and “How long will it hurt him?” (duration dimension). The eight 
counterpart pictures of the PAT (four Anticipation and four Other-Inflicted) were 
not used in this study. Additional information regarding rationale, development, and 
use of the Pain Apperception Test is available in previous publications“: 7). Sub- 
jects were individually given the PAT, usually at their work station, with time per 
subject averaging approximately 15 minutes. Analysis of results was done accord- 
ing to statistical techniques as advocated by Edwards®?’ and by McNemar ®’. 

1The writer is indebted to Dr. Rex. M. Collier, Chief Psychology Service, for active assistance in 
the literary presentation; and to those personnel of Jefferson Barracks Veterans Administration 
Hospital who acted as subjects. 
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LESULTS 

Split-half reliability of PAT scores was .60 for Intensity and .89 for Duration 
for the 50 males; and .70 for Intensity and .84 for Duration for the 50 females. Mean 
difference between sexes (N = 50 each) was significant for the education variable; 
age difference was not. There were comparatively small and non-significant mean 
differences between sexes in both age and education for the professional and for the 
clerical worker sub-groups. Results of two-tailed t-tests comparing the sexes as to 
mean PAT scores, for all subjects and for the component professional and clerical 
sub-groups, are presented in Table 1. 
TABLE 1. Patn APPERCEPTION TEsT ScorRES: SEVENTEEN PicTurE ToTaLs, COMPARISON OF SEXES 
Female 


Subjects — en — 


M 





Intensity 
All Ss 
Professional 
Clerical 


Duration 
All Ss ‘ 56.% 8.67 
Professional 2 55 9.11 
Clerical 2: 57.86 8.83 


*Significant between .05 and .01 levels. 


Table 1 reveals that in all cases the females rated both the intensity and the 
duration of the picture 1 pain situations as higher than did the males. The difference 
was significant (P between .05 and .01) for Duration when entire groups were com- 
pared, and there was a marked tendency for the females to ‘‘see’’ more pain Intensity 
in the pictures than the males. Sex differences between clerical worker sub-groups 
were small whereas differences between the two professional worker sub-groups ap- 
proach statistical significance (p between .10 and .05) for both Intensity and 
Duration. 

Comparison of sexes as to mean scores on individual pictures revealed that the 
females rated 10 of the 17 pictures as signifying greater Intensity, and 14 of the 17 
pictures as signifying greater Duration, than did the males. Significant differences 
(critical ratio) between the 50 males and 50 females in judging specific PAT pictures 
are shown in Table 2. 

TasBLe 2. SiGNiFIcANT Sex DirreRENcES IN Evatuation Or Par ApprercepTion Test PicrurEs 
(N = 50 Mauss, 50 FEMALE 


Pain Male Female 


Dimension 
Intensity 
6 Duration 
10 Intensity 
10 Duration 
11 Intensity 


*Significant at .05 level. 
**Significant at .01 level. 


The five significant differences concern four pictures (No. 3, shin hit by baseball; 6, 
electric shock; 10, fall on back from ladder; 11, ice dropped on foot) of varying de- 
grees of intensity and /or duration as indicated by the mean scores. In each instance 
the females scored higher. 








414 DONALD V. PETROVICH 


DISCUSSION 

Because of disproportionate representation of the sexes in certain jobs, and the 
added factors of time and availability, the subjects were fairly heterogeneous as to 
occupational classification. The sub-classification into professional and clerical 
workers was undertaken better to control the possible importance of occupational 
identity in regard to pain attitudes. The resulting strong tendency among pro- 
fessionals for females to judge the pictures as more painful than did the males, in 
conjunction with the negligible sex differences between clerical worker sub-groups, 
presents the possibility of significant occupational biases. The professional category 
included psychologists, physicians, social workers, rehabilitation therapists, nurses, 
and others, and these were not equally represented as to sex. 

Evaluation of results is also complicated by the fact that all PAT pictures depict 
males. The importance of this in influencing and possibly producing sex differences 
in empathy and projection is problematical. However, the significantly greater mean 
Duration total response to all pictures by the entire female group (Table 1), along 
with their significantly greater Intensity and /or Duration response to a specific 4 of 
the 17 pictures (Table 2), constitutes some evidence that real sex differences in pain 
apperception and attitudes may exist. 

Results of this exploratory study do suggest a measure of agreement with the 
psychophysical studies cited above, wherein higher pain perception and/or pain 
reaction thresholds were generally reported for the males. In the present experiment, 
the occupational heterogeneity of subjects precluded more definitive statistical an- 
alysis, and further experimentation along these lines should better control for this 
and other relevant variables. Present results also indicate that the assessment of 
occupational biases in pain attitudes may well prove interesting and fruitful. 


SUMMARY AND CONCLUSIONS 

This study was designed to initiate a new method of assessing sex differences in 
pain reactions. As contrasted with the traditional psychophysical method, the re- 
cently developed Pain Apperception Test was used to assess sex differences in pre- 
dispositions to experience pain. Intensity and Duration judgments of 17 pictures 
of the PAT were obtained from 50 male and 50 female hospital personnel. Split-half 
reliability of this essentially projective test was acceptably high, ranging from .60 to 
.89. Obtained results indicated that females were generally predisposed to see more 
pain—both Intensity and Duration—in the PAT pictures than were the males; with 
this difference being significant for total Duration scores, and for a specific 4 of the 
17 pictures. Similar sex differences were present but less apparent (strong tendencies) 
when professional worker sub-groups were compared, and negligible differences were 
found in comparing clerical workers. The depiction of males in all the PAT pictures 
presented a complicating factor, and the possible influence and importance of oc- 
cupational biases in pain attitudes was noted. 
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AN ANALYSIS OF GROSS ERRORS ON A 
SENTENCE COMPLETION TEST 
WILSON H, GUERTIN 


Veterans Administration Hospital 
Knoxville, Iowa 


PROBLEM 
Sentence completion type tests provide such interesting material for content 
analysis that errors, per se, are given little attention in the final interpretation. The 
psychiatric patients at our hospital produce many gross errors that we would like to 
understand better. It was hoped that an analysis of the intercorrelations of these 
errors made by our patients might reveal some consistent dysfunctions that might be 
identifiable and of diagnostic significance. 


PROCEDURE 

One hundred Rotter Incomplete Sentence Tests“, containing at least five 
errors each, were selected from the files of our psychiatric hospital patients. To 
intercorrelate errors it was necessary for the investigator to assign them semi- 
objectively into categories. The errors were studied, categorized and, finally, after 
much trial and error, some fairly reliable categories emerged as seen in Table 1. The 
frequency of occurrence of a given category of errors was to be intercorrelated with 
each other category so cutting lines were established and tetrachoric intercorrelations 
were calculated. The matrix was factored by Thurstone’s complete centroid method 
and the resultant matrix was rotated to oblique simple structure. 


RESULTS AND DIscusSION 

Table 1 shows the oblique factor matrix with a brief designation of the nature 
of the error-item. The four obtained factors accounted for about half of the total 
variance of the error-items, suggesting that errors are not just random, unsystem- 
atic occurrences but may be of diagnostic significance. Correlations between factors 
are all less than .20 except for —.25 between the first and second, and .43 between 
the first and third. While the first three factors represent tendencies to commit un- 
desirable grammatic and syntactic errors, the fourth factor seems to represent the 
constructive tendency to express oneself economically and effectively 

TaBLE 1. Osiigue Factor Loapines ror Error-I TEMS 


Word Irrelevance Poor Telegraphic 
Error-item Carelessness Spelling Phrasing 


Simple word used wrongiy —01 05 
Gross malapropism f 2° 26 38 
Extra word inserted 3 2% 11 —19 
Simple word substitution 3¢ 17 17 
Answers with phrase —37 - 36 —13 


Disregards printed structuring d 8 20 10 
Repeats word perseveratively . —()4 03 
Total irrelevance 
Final “‘s’”’ omitted 


Letter substitution 
Error in spelling 
Uncrossed ‘‘t’’ 


Omission (other) 
Tense wrong 
Omits article 
Omits verb 

Omits conjunction 
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that were introduced to clarify the nature of the obtained factors. They did not 
enter the actual factor analytic determinations. The first eight reference items were 
based exclusively upon ratings of the nature of all the sentence completion responses. 
The items were selected a priort from many possible ratings of content in the hope 
that they would prove most relevant to the area under investigation. The seventh 
and eighth items were based upon word and item counts, respectively. 


Table 2 presents factor loadings on some items, entirely unrelated to errors, 


TaBLeE 2. ExtTenpgep Factor LoapInGs FoR REFERENCE ITEMS 


Word Irrelevance Poor Telegraphic 


Reference item Carelessness Spelling Phrasing 
I 


Delusions 18 24 , 55 
Abstractions 30 —23 23 50 
Complaints y —21 ¢ 20 
Denials 55 34 , 18 
Perseveration of ideas j —10 2 05 


Platitudes : —25 2 —21 
Productive 2: —33 25 
Skips items : —15 —20 


IQ (Shipley) 5 —32 37 
DQ (Shipley) 30 —15 12 —53 
Education 16 : —O09 16 34 
CA —05 21 18 09 
Paranoid schiz. 09 05 10 11 
Organic B.D. —24 03 06 —32 
Chronic undiffer. schiz. —06 09 —01 —02 


—_ — a — 





Table 2 reveals the strong influence of a group of schizophrenics who are de- 
lusional, concerned with abstractions, andjshow a better preservation of verbal than 
conceptualizing ability. Since verbal ability is retained, the Shipley 1Q remains 
higher than for those with a severe loss of. verbal ability, thereby suggesting less 
pathology. However, the preservation of verbal ability produces a lower DQ for 
these patients although they actually have suffered less deterioration. These sub- 
jects employ the relatively sophisticated style of expression labeled the Telegraphic 
Phrasing factor. 

Another cluster of correlations is encountered in Table 2 that implies the 
presence of individuals who complain excessively, do not deny their own problems, 
and have poorly preserved mental ability. They tend to perseverate with a single 
idea because of low ability and preoccupation with problems. These subjects are so 
self-focused that they fail to attend to what they write as seen by their heavy loading 
on the Word Carelessness factor. 


SUMMARY 
One hundred Sentence Completion Tests of mental hospital patients were ex- 
amined for gross errors. The errors were categorized and then intercorrelated. A 
factor analysis demonstrated that there was consistency in the nature of the errors 
made and that the type of error was related to other material of clinical significance. 
The four factors obtained were labeled: Word Carelessness, Irrelevance, Poor Spelling, 
and Telegraphic Phrasing. 
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DEVIANT RESPONSES AS INDICATORS OF IMMATURITY 
AND NEUROTICISM! 
JOHN C. ROITZSCH AND IRWIN A. BERG 


Louisiana State University 


PROBLEM 


Various studies have demonstrated that the distribution of subjects’ responses 
to relatively unstructured stimuli do not follow chance distributions: * 4 & 6 7, 8 
10, UW). Typically, one finds percentages of 80-20 or some equally skewed distribu- 
tion®’. Such biased response patterns may be reflections of personality character- 
istics; however, attempts to use the bias or pile-up of responses as measures of 
personality and other traits have met with only moderate success®: 7}, But when 
attention is turned from the subjects who exhibit the common or popular expression 
of response bias to those whose responses run cownler to the common bias, a Gefinite 
relationship to personality characteristics may be discerned. Such responses which 
were significantly different statistically from the common response pattern have been 
labelled deviant responses by Berg: *. 

An examination of deviant responses as they occurred in various groups of 
subjects led to the formulation of the Deviation Hypothesis which has been stated 
as follows: ‘‘Deviant response patterns tend to be general; hence those deviant be- 
havior patterns which are significant for abnormality (atypicalness) and thus re- 
garded as symptoms (earmarks or signs) are associated with other deviant response 
patterns which are in noncritical areas of behavior and which are not regarded as 
symptoms of personality aberration (nor as indicators, signs, earmarks).’’ ‘ P. 19 

A number of studies“ 4 & 9% 1) of the Deviation Hypothesis have used the 
Perceptual Reaction Test (PRT) as a convenient instrument for eliciting deviant 
responses. This test consists of 60 abstract designs drawn with ruler and compass, 
each with four options: like much (LM), like slightly (LS), dislike slightly (DS), and 
dislike much (DM), of which the subject is required to choose one. About seven 
minutes are required for normal subjects to complete the test. 

Employing the PRT in the framework of the Deviation Hypothesis, Barnes “ 
used the ae of deviant responses to develop reliable scales for measuring 
general aberrant behavior, psychotic states, character disorders, and schizophrenic 
reactions. Hesterly and Berg? investigated immaturity of behavior in schizophrenia 
using the PRT to fn termine whether the deviant response patterns of adult schizo- 
phrenics would be similar to those of normal children. Their results indicated that 
the response patterns of groups of normal, young children (ages 7, 9, 11) to the PRT 
significantly differed from those of normal adults but were similar to those of adult 

. schizophrenics. They also reported that as normal children mature, their response 
patterns gradually approach the patterns for normal adult responses on the PRT. 

The question then arises, since the response patterns of groups of normal, young 
children (ages 7, 9, 11) more nearly approximate those of schizophrenic adults than 
that of normal adults, whether the patterns of groups of older children (ages 12-17) 
approximate those of neurotic adults, who are usually characterized by somewhat 
more mature behavior than schizophrenic adults? Therefore, according to the De- 
viation Hypothesis, we should expect that immature and neurotic reactions, which 
represent deviant responses in critical areas of behavior, would be associated with 
deviant responses in noncritical areas of behavior, such as atypical option preferences 
on the PRT. On this basis it may be initially hypothesized that a significant number 
of the PRT responses made by a group of normal adolescents, when compared to a 
group of normal adults, will be deviant and will serve to differentiate the normal 
adolescents from the normal adults. Similarly, as a second hypothesis, groups of 
adult neurotics will be distinguishable from groups of normal adults on the basis of 
PRT deviant response frequency. However, since normal adolescents and neurotic 


1An expanded version of this study is available from Irwin A. Berg in mimeographed form. 





418 JOHN C. ROITZSCH AND IRWIN A. BERG 


adults are regarded as behaviorally immature, a third hypothesis would state that 
the adolescents and the neurotic adults should reveal a similar number of deviant 
responses. Finally, it is hypothesized that normal adults and neurotic adults will 
be significantly different statistically from adult schizophrenics in terms of deviant 
response occurrence as measured by the Barnes"? schizophrenia scale. 


MeETHOD 

Three hundred white school children (150 males and 150 females) and a group 
of 45 white neurotic adults (20 males, mean age 37.5 and 25 females, mean age 41.2) 
were used as subjects for the present study. The school children were divided into 
three age groups, 12-13; 14-15; and 16-17, with 50 males and 50 females in each 
group. ‘These adolescent subjects were obtained from various public schools in 
Louisiana, and all were normal in the sense of exhibiting no obvious signs of psycho- 
pathology. The adult neurotic subjects were drawn from psychiatric clinics in 
Louisiana and Texas and had received a staff diagnostic label of ‘‘psychoneurosis’’. 
The normal adult subjects were 500 white males and 350 white females drawn by 
Barnes“? from a norm group of more than 4,000 subjects in Berg’s files. The PRT 
data for schizophrenic patients were also obtained from Barnes’s study. 

The PRT was administered to the children in their classroom and to the neurotic 
adults in the psychiatric clinics. The PRT responses for each subject were trans- 
ferred to IBM cards for scoring and tabulation. For every PRT item, the four option 
choices made by each of the three groups of adolescents were separately summed for 
male and female patients. The same procedure was followed for the adult neurotic 
and normal groups. These frequencies of option choices were then converted into 
percentages for each group with males and females tabulated separately. The per- 
centage of normal adult subjects choosing each option was then compared with the 
percentage of adolescent subjects in each of the three groups who chose the same 
option. The same comparison was made for the adult normal with adult neurotic 
groups and for the adult neurotics with each of the adolescent groups. Each com- 
parison required testing the level of statistical significance of the difference in the 
percentages for each option by use of chi-squares in a contingency table as described 
by Mainland and Murray“. Since the PRT has 240 options and 14 comparisons 
between the various groups were required, the level of statistical significance for 
3,360 options was determined in the course of the present study. Finally, to deter- 
mine whether the neurotic groups were different from schizophrenics, the PRT 
choices for normal and neurotic groups were scored on the Barnes“? PRT schizo- 
phrenia scales (called Sigma Scales by Barnes) and the significance level of the mean 
schizophrenia scores for each group was determined. 


RESULTs AND DiscussION 

On the basis of chance alone, two or three of the 240 PRT options would occur 
as deviant responses when the one per cent level of statistical significance is used in 
comparing groups. It was found that this chance expectancy was vastly exceeded. 
From 74 to 144 of the 240 PRT options differentiated groups of normal adolescents 
from normal adults at the one per cent level of significance. 

When neurotic adults are compared to normal adults 132 options for the neurotic 
males and 120 options for the neurotic females were significantly different at the one 
per cent level when compared to the normal adult males and females. The data in 
Table 1 summarize the results of scoring the normal, neurotic, and schizopbrenic 
groups with Barnes’ PRT schizophrenia scale. It is evident that both male and 
female neurotics are significantly different from male and female schizophrenics. 
The neurotic females are also significantly different from normal females in schizo- 
phrenia score. The neurotic males, however, do not differ in this respect when com- 
pared to normal males. 

Thus, as hypothesized, normal adolescents and adult neurotics both differ sig- 
nificantly from normal adults when compared on the basis of deviant responses to 
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TABLE 1. MEAN PRT ScuizopHRENIA SCORES FOR GRouPS OF NORMAL, PSYCHONEUROTIC, 
AND SCHIZOPHRENIC ADULTS 
Normals Neurotic Schizophrenic 
Males Females Males Females Males Females 
(N = 500) (N = 35Q) (N 20) (N = 25) (N 99) (N 68) 


Mean Schizophrenia 
Score 19. 43° 20" 3. 56> 2.4 6.14! 


aThe data for normal adults and schizophrenic adults are from Barnes“), 

‘Significantly different from the mean score of the s¢ hizophre nic group at the one per cent level. 

‘Significantly different from the mean score of the normal group at the one per cent level. 
the PRT. The normal] adolescents and neurotic adults, however, have a quite similar 
deviant response frequency. Hence, adult neurotics are immature in the sense that 
their PRT responses are like those of 12 to 17 year-old children rather than like 
those of normal adults. The deviant response pattern of neurotic adults does not 
resemble that of schizophrenics, for the two groups are readily distinguished from 
each other by the Barnes schizophreniza scale. 


SUMMARY 
The Deviation Hypothesis holds that deviant responses are general; hence 
deviant responses in a critical area of behavior (7.e., neurotic symptoms) will be 
associated with other deviant responses in noncritical areas of behavior (7.e., choice 
of test options). As a test of one facet of this hypothesis, the Perceptual Reaction 
Test (PRT) was administered to 300 normal adolescents and 45 neurotic adults. 
Statistically significant differences in the frequency of deviant responses were found 
between groups of normal adolescents and normal adults. Significant differences 
were not found between groups of normal adolescents and neurotic adults, indicating 
that neurotic adults reveal PRT deviant response frequencies which are similar to 
those of normal adolescents. Thus, neurotic adults exhibit immature response pat- 
terns in the sense that they are quite similar to those of normal adolescents as 
measured by deviant response frequencies on the PRT. The deviant response pattern 
of neurotic adults was found to be significantly different from that of adult schizo- 
phrenic patients. The evidence from the present study indicates that groups of 
normal adolescents and groups of neurotic adults can both be differentiated from 
normal adults on the basis of deviant responses occurring on the PRT. Also, groups 
of normal and neurotic adults can be distinguished from adult schizophrenics on the 
basis of deviant response patterns. 
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AN INTERNAL CONSISTENCY ANALYSIS OF THE 
FORCED-CHOICE ANXIETY SCALE 
JOHN A. HAMMES AND E. DELI, YOUNG! 


University of Georgia 


PROBLEM 

Heineman)? has revised the true-false Taylor Anxiety Scale to a multiple-choice 
format that minimizes the response bias of true-false items. Hammock“? sub- 
sequently administered a revision of the Heineman scale to Army trainees, in the 
interest of relating A-scores to personnel selection. The resulting normative data 
were found to approximate Heineman’s findings to the degree that Hammock con- 
cluded “‘it is unlikely that the Army and college groups differ on manifest anxiety.” 
The present study consists of an internal consistency analysis of Hammock’s re- 
vision of the Heineman scale, and a comparison of the present sample with those of 
Heineman and Hammock. 


MeETHOD 

Hammock’s scale was administered as an ‘‘attitude check list’’ to 302 students 
taking introductory psychology at the University of Georgia, and the upper and 
lower 27 per cent of the distribution were selected. For each scale item, there were 
computed the percentages of these two groups checking the item. Item validity 
coefficients were then obtained by entering these percentages in Flanagan’s table, 
which gives estimates of the product-moment correlation between the two underlying 
normally distributed variables. An item analysis against extreme groups was carried 
out ; 

The A-scale was administered with the following instructions: ‘‘In this question- 
naire you will read statements describing various attitudes and feelings. The state- 
ments are arranged in blocks of three. Read the three statements and pick out the 
one that describes your attitude or feeling most and mark it here (illustrate with 
sample items). Then, from the remaining two statements, choose the one that des- 
cribes your attitude least and mark it here (indicate). Then go on to the next block of 
three statements, and repeat the procedure. Remember, there are no right or wrong 
answers. Your score is your own, because everyone is different. Be sure to mark all 
blocks. Any questions?”’ 


RESULTS 
Item validity coefficients for the 50 anxiety items are presented in Table f, in 
an arbitrary grouping. There was one item which correlated negatively, namely, 
TABLE 1, INTERNAL ITEM VALIDITY COEFFICIENTS FOR 
THE Firty ITEMS OF THE ForcED-CHoIcE A-ScaLE 


Range of Coefficient Frequency 


.75 — 1.00 0 
.50- .74 
25 - 49 


00- .24 


—.12. The next higher item correlated .11, and the highest correlated .63. Forty- 
three of the items were above .25. Items correlating below .25 will be revised for 
improvement. 


1Now at Human Research Unit 3, Fort Benning, Georgia. 
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Surprisingly, the score distribution of the present sample differed significantly 
from those of Heineman and Hammock (Table 2; Figure 1). Reliability coefficients 
of the three samples are comparable (Table 2). 


TABLE 2. MEANS AND RELIABILITY COEFFICIENTS OF THE ForcED-CHOICE A-SCALE FOR 
THREE SAMPLES 


“Mean r ‘SD. 


Sample 


Heineman 206 25.5 6.22 
<.001 
Hammock 
< .001 
Hammes 302 
*Kuder Richardson Formula 21. 
**Odd - even split half. 





It is unlikely that ‘leineman’s sample and the present sample differ so greatly in 
manifest anxiety, since both were drawn from introductory classes in psychology. 
Since the present sample achieved lower scores on Hammock’s A-Scale than did 
Army trainees, it might be presumed that Heineman’s sample would have also. Have 
college students, then, less manifest anxiety than Army trainees? There is another 
possible explanation for score differences, in terms of Hammock’s revision procedures. 
The forced-choice blocks of items in Hammock’s scale were constructed on the basis 
of item favorability ratings made by Army cadre, whereas Heineman used ratings by 
college students. Perhaps college students are more alert to favorability bias than 
are Army personnel and consequently achieve lower scores on a scale prepared on the 


Fig. 1. FrREQquENCY PoLyGons or Forcrep-CuHoice A-Scores For Two SAMPLES. 
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basis of ratings by Army personnel than they would on one involving ratings by 
college students. The two scales would consequently not be comparable in score 
value. In summary, the difference between scores of the Hammock sample and the 
present sample may be due to (a) differences in manifest anxiety, or (b) differences in 
scale construction, or (c) possibly both factors. The question, then, of equivalence of 
manifest anxiety in college and Army samples remains open. 


SUMMARY AND CONCLUSIONS 
An internal consistency analysis was carried out on the Hammock version of the 
Heineman Forced-Choice Anxiety Scale, using the upper and lower 27 percent of a 
sample consisting of 302 college students. It was found that the forced-choice form 
of the true-false anxiety scale has relatively high internal consistency, and is therefore 
a desirable format for such a questionnaire. It was also found that the equivalence 
of manifest anxiety in college and Army samples is questionnable. 
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FORMBOARD COMPARABILITY IN THE COLORED 
PROGRESSIVE MATRICES 


THOMAS E. JORDAN 


Washington University 


PROBLEM 

Several studies @: *: ©) have suggested that the Colored Progressive Matrices“, a 
measure of intellectual ability, has considerable value. Clinical application of the 
scale has led to consideration of the value of the incomplete designs, which are the 
substance of the scale, in another context; specifically, the provision of kinaesthetic 
and depth cues in the format of a formboard. Such a medium would also provide a 
slightly more provocative basis for the subjective estimates of problem solving tech- 
niques which are a valuable adjunct to the normal, objective statement of test re- 
sults. The purpose of the study was to evaluate the comparability of a formboard 
version of the Colored Progressive Matrices to the conventional booklet version of 
the scale. 


3 


PROCEDURES 


Thirty-six formboards were constructed, identical in all respects except that 
(a) the incomplete design was recessed to a depth of one-eighth of an inch, and (b) 
the possible choices to complete the design were made of wood, and all were of such a 
shape and size as to fit the recess. 

Two groups (N = 49) of school children were derived randomly and matched 
to provide groups comparable by mental age. One group was given the formboard 
scale, the other group took the booklet version. 
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LESULTS 
Table 1 presents CA’s, MA’s, IQ’s and comparable booklet and formboard raw 
scores. With a ¢-value of 1.51, the difference between the means of the booklet and 


TABLE 1. 


Range Mean 





CA (Months) 
Group I 99-111 101.60 
Group II 97-108 102.24 
MA (Months) 
Group I 98-138 116 
Group II 98-146 120 


IQ 


Group I 93-127 110.‘ 9. 
Group IT 91-138 114. 6¢ .70 





Booklet Raw Scores 16-28 31.7 .80 


Formboard Raw Scores 18-30 2% 3.29 





formboard scores does not approach significance. The coefficients of reliability (split 
half) were .698 for the booklet form and .675 for the formboard. The coefficients of 
validity, defined as Pearson r against Stanford-Binet MA’s, were —.033 for the 
booklet form and .203 for the formboard, both quite low and strikingly at variance 
with results of other investigators 


SUMMARY 


This study compared test scores on the booklet form of the Coloured Progressive 
Matrices with a new formboard version. The reliabilities of the two forms were 
comparable, and the difference between the mean raw scores of the two forms did not 
approach significance, so it is concluded that booklet and formboard versions are 
comparable. Both scales produced low correlations against Stanford-Binet scores. 
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A SHORT FORM OF THE WECHSLER INTELLIGENCE SCALE 
FOR CHILDREN 


WILLIAM H. SIMPSON AND CECIL C. BRIDGES, JR. 
Oklahoma City Public Schools 


The particular combination of subtests used in the present study (Vocabulary 
and Block Design) was chosen because a sum of the scores of these two subtests 
showed high correlation with total weighted scores in the three age groups for which 
Wechsler“? presents tables of intercorrelations, using McNemar’s formula®?. The 
sample in the present study consisted of 120 children referred to the Division of Child 
Guidance of the Oklahoma City Public Schools. This sample is fairly typical of the 
population of children who come to the attention of social agencies and child guid- 
ance clinics. Table 1 presents descriptive data concerning general intelligence and 
chronological age of the subjects in this sample. 

TaBLE 1. MEAN, STANDARD DEVIATION, AND RANGE FOR CHRONOLOGICAL AGE 
AND WISC Fut Scace IQ For THE SUBJECTS 


Factor 


{ange 


Mean 
Chronological Age (mo.) 65-192 124.23 
Full Scale IQ 54-142 95.10 15.87 








The correlation of the Vocabulary-Block Design sum with the full weighted 
score was .874. The obtained relationship seems strong enough to warrant the use 
of a short form composed of these two tests as a screening instrument. When con- 
trasted with results of studies of short forms of the Wechsler Bellevue Intelligence 
Scale, the present findings are at least as favorable as those obtained with the 
WBIS®. Table 2, calculated from the regression equation and the WISC table 
for converting full weighted scores to 1Q’s, is given for converting the sum of the 
weighted scores of Block Design and Vocabulary tests to an estimated full-scale IQ. 

TABLE 2, CONVERSION OF SuM OF WEIGHTED ScorEs ON BLock DESIGN AND VOCABULARY 

TO PREDICTED Fuut ScatE IQ 
Blocks & Predicted | Blocks & Predicted 
Vocabulary Full Scale IQ Vocabulary Full Seale IQ 





| 21 104 
22 107 
109 
112 
115 
117 
120 
123 
125 
128 
131 
133 
136 
138 
141 
144 
146 
149 
152 


154 


EE 


Ute Co 


a 


o-] 
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BASE RATE DATA ON RORSCHACH CARD REJECTION* 
WALTER F. MC KEEVER AND ALVIN I. GERSTEIN 


University of Rochester 


PROBLEM AND PROCEDURE 


Although Rorschach card rejection is one of the more gross and easily denotable 
responses to the test, such behaviors have received little systematic research at- 
tention. The studies of Mensh and Matarazzo? and of Sisson et al.“’, have pro- 
vided the only substantial data on card rejection. The present study, utilizing a 
large group of Rorschach protocols, aims primarily at determining if there are signi- 
ficantly different rejection frequencies between the cards. In addition, the relation- 
ships between rejection, and reaction times and [Q respectively, will be assessed. 

All cases having been administered Rorschachs at the Rochester VA Mental 
Hygiene Clinic were taken from the test files, and rejection frequencies were tab- 
ulated for the ten cards. The only criterion for accepting a record for study was that 
it have at least ten responses. Any card failing to evoke a response properly scorable 
in the genetic framework, as presented by Friedman“), was counted as a rejection. 

The rejection data are based on an N of 693 males of various diagnoses, this 
number representing the entire Clinic population as defined for study. In addition, 
424 of these patients had been given either Full Scale or prorated Wechsler-Bellevue 
I tests, permitting assessment of the relationship between IQ and cara rejection. The 
mean 1Q of these patients was 106.03, and the mean number of Rorschach responses 
for the entire group was 21.8. A total of 79 records had less then 10 responses and are 
not included in the N of 693. 


RESULTS AND DISCUSSION 


TABLE 1. NUMBERS .ND PERCENTAGES OF REJECTIONS OF Eacu Rorscuacyu Carp 


Cards ] IT] P : VII VITl IX X | Sum 
Number of re- 
jections 2 8 } ‘ ‘ 57 26 | 238 


Percentage of 
tejections , 9.2 6 3.¢ 2% 23.9 | 10.9 | 100 


The essential base rate data are summarized in Tables 1 and 2. Accepting the 
5% level as the criterion for significance, individual card-to-card chi square analyses 
(corrected for small cell frequencies where necessary) reveal three distinct groupings 
of the cards, with no significant differences within groups. Thus, cards IX, V1, and 
VII are all rejected significantly more often than cards X, I], or IV. Each of these 
six in turn is rejected significantly more often than III, VIII, or I. Card V, while not 
falling within any of these groups, is significantly different from all cards of greater 
rejection frequencies, except card IV. In the population studied, rejection behavior 
is not uncommon, since essentially 25% of the group reject cards. The rejection of 
certain cards, however, such as I, III, and VIII is so infrequent as to be considered 
markedly deviant. From the multiple-rejection data summarized in Table 2, it can 
be seen that the rejection of more then two cards is extremely rare, and no one 
rejects more than four cards. 

*Appreciation is acknowledged to the staff of the Rochester Veteran’s Administration Clinic, 
Rochester, New York, for their cooperation in making this study possible. 





WALTER F. MC KEEVER AND ALVIN I. GERSTEIN 
TABLE 2. NUMBERS AND PERCENTAGES OF PATIENTS REJECTING ZERO TO Four CarRDs 


Number of Rejections 
Total 








Number of Patients 


tejecting 693 





Percentage of 
Patients Rejecting 75.% , 9 , 99.9 


Table 3 presents findings on the relationship of IQ to rejection behaviors. The 
point biserial correlation between IQ and card rejection is —.12, significant at the 
TABLE 3. NumMBERS, MEAN IQs, STANDARD DEVIATIONS, AND t VALUES FOR THE 
IQ DiIrFERENCES FOR NoN-ReEJEcTORS VERSUS REJECTORS OF ONE CARD AND 

MULTIPLE-REJECTORS 


SD 


Mean IQ 








Non-Rejectors vs. 31 108.2 14.42 
Rejectors of One 104.5 13.41 
Multiple-Rejectors 3¢ 103 .3 16.39 


*Significant at better than the 5% level. 





5% level. Non-rejectors are significantly more intelligent than either category of 
rejectors. 

Rank order correlations between obtained rejection ranks and various reaction 
time and rejection ranks reported by other investigators": * 4 5 ® 7) are significant 
for all adult groups (5% level or better), and range from .75 to .97. Indeed, it ap- 
pears that rejection ranks correlate as highly with reaction time ranks as rejection 
ranks correlate among themselves. Clearly, those cards with the longest reaction 
times are those most frequently rejected. 

Chi square reveals no significant difference between the colored versus non- 
colored cards, while the difference between the 5 “broken” and 5 “unbroken’’ blots 
is significant at the 1% level. The latter finding is consistent with Friedmaa’s @ 
assumption that “unbroken”’ blots are perceptually more difficult to structure. 


CONCLUSIONS 

The results of the present study are consistent with those reported for rejection 
by other investigators’: © studying different clinical groups. Although IQ is nega- 
tively correlated with rejection, the relationship accounts for only 1% of the var- 
iance. Conceivably, tests of more specific abilities rather than the global one used 
here would show a greater relationship to rejection. That there is a significant differ- 
ence in rejections on the dimension of ‘‘brokenness”’ suggests that perceptual analyz- 
ing and integrating abilities might be two such factors. That other factors and inter- 
actions are probably involved in producing rejections is indicated since “brokenness” 
does not maximally differentiate the most from the least rejected cards. One might 
reasonably hypothesize that the observed rejection pattern is at least partially a 
function of an accumulation of reactive inhibition interacting with stimulus com- 
plexity, their effects being manifested among individuals within some critically low 
range of overall drive levels; such a ‘‘fatigue’”’ factor seems possible in view of the 
fact that 77% of all rejections occur on the last five cards. 

The consistently similar card rankings on rejection and reaction time suggests 
that these variables have little diagnostic value in and of themselves, since the cor- 
relations are similarly high regardless of the clinical groups involved. This does not, 
of course, preclude the possibility that there are important personality correlates of 
rejection patterns. That children have a quite different rejection pattern “? suggests 
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the possibility of developmental sequences of such patterns, which might serve as an 
index of genetic maturity. 
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MMPI PROFILES OF PARENTS OF CHILD PSYCHIATRIC PATIENTS! 
LEO J. HANVIK AND MILDRED BYRUM 


Washburn Memorial Clinic 
Minneapolis 5, Minnesota 


PROBLEM AND METHOD 
The staff of the Washburn Memorial Clinic, a child guidance clinic, have been 
using the MMPI routinely“ in the assessment of the children’s parents’ personal- 
ities. The present study involves all of the profiles obtained on parents of children 


referred to the clinic in a two year period. The total sample of profiles included 104 
father profiles and 158 mother profiles (except that in the portion of the study dealing 
with the relationship between parent MMPI patterns and the children’s syndromes, 
the numbers were 96 fathers and 156 mothers). The larger number of mothers than of 
fathers resulted partly because a number of the children had no father in the home 
and partly because some of the fathers did not participate in the clinic treatment 
program or were not as cooperative as the mothers in meeting requests of the clinic. 

In the study two methods of profile analysis were used: (a) mean T scores were 
computed on all scales and (b) the profiles were all coded by the ranking method and 
the mean ranks for all scales were computed for the mothers and for the fathers. The 
mean profiles in Figs. 1 and 2 as well as the individual profile patterns were then in- 
spected and interpreted. In an effort to note possible relationships between parent 
MMPIs and child syndromes, the child of the parents studied was in each case 
classified according to a problem classification table previously devised at the clinic 
for another study. An analysis was then carried out in an effort to determine what 
types of parent profile codes-appeared to be associated in undue proportion in any 
of the child problem categories. 

RESULTS 

Comparison of Profiles. Composite profiles for the parents are plotted in Fig. 1 to- 
gether with mean T scores on all scales for 158 mothers and 104 fathers, showing that 
both the mothers and fathers, as groups, tend to score above average on the Hy, Pd, 
Pa, Pt and Se, with the strongest trends showing up on the Hy and Pd scales. Fig. 
2 shows that in terms of relative scale relationships or of prominance of position 
within the individual profiles, the Hy and Pd scales occupy the first and second 

‘The authors are indebted to Mr. William H. Dressler and to other members of the social work 
staff of the Washburn Memorial Clinic for their work in obtaining MMPI profiles on parents as a part 
of the clinic’s intake process. The authors also wish to express their indebtedness to Dr. Mollie Mac- 
Eachern, also of the Washburn Memorial Clinic for her advice and other assistance in this study. 
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positions respectively in both the mothers’ and fathers’ profiles, omitting the Mf 
scale for the moment. The predominance of Hy and Pd scale elements reflected in 
the above mean T scores and mean scale rank patterns of the parents’ MMPIs was 
readily observable also in an inspection of the individual profiles. For example, the 
“eonversion V’’ configuration? was found to be present in 24% of the father’s profiles 
and in 33% of the mothers’ profiles. With regard to Pd scale prominence in in- 
dividual profiles, 13% of the fathers and 18% of the mothers had Pd scale T scores 
over 70. Although the statistical significance of the above incidences is undetermin- 
able, they are quite obviously high and bear out the heavy weighting of Pd and Hy 
scale elements in Figs. 1 and 2 illustrating mean profile T scores and ranks. 

\ striking feature of Figs. 1 and 2 and also discernible upon inspection of the 
individual profiles, was the Pd minus Ma differential which in some individuals, 


Fic. 1 Tue Mean T Scores oN MMPI Sca.ss For 158 Moruers and 104 FaTHERS 


PROFILE CHART 


Pp Se Ma Si 
+.4K +1E +1E +.2E 
Mean Scores 
(Men) 1§ 56 5! é 5 56 55 58 51 # 5i1 
Mean Scores 
(Women) 50 55 §& AS 47 52 55 50 538 538 52 


particularly mothers, was extremely high. (In one mother’s profile, for example, the 
Pd score was 49 T score points higher than the Ma score. This mother, incidentally, 
showed no “acting out’’ trends of a generalized sort but had a severe conflict with 
both her husband and child.) The Pd minus Ma differential was significantly greater 
in the mothers than in the fathers, in both samples being significantly greater than 
zero.’ The relatively high Pd score and relatively low Ma score in parents’ profiles 


2A conversion V was defined for our purposes as a profile in which the Hs scale and Hy seale both 
yielded T scores at least half standard deviation above the T score on the D scale. This pattern pre- 


viously @: 5) has been shown to occur in persons who use repression as a defense mechanism and who 
manifest anxiety in dissociated or “conversion”? symptom patterns. 

‘The mean Pd minus Ma differential for the fathers group (N 103) was 3.8 with a S.D. of 11.4. 
The mean Pd minus Ma differential for the mothers group (N 157) was 9.7 with a S.D. of 11.3. The 
t for the difference between these means was 4.7 with a probability of less than .001. The fathers mean 
Pd minus Ma differential was significantly different from zero with at of 3.38 which has a probability 
of less than .001. 
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has been of interest to our staff because it seemed to occur so frequently in situation- 
ally maladjusted individuals reacting adversely and with intense hostility to a portion 
of the social environment, t.e., persons with severe marriage problems. Nelson“ has 
suggested a 15 point or greater Pd-Ma differential as pathological on the basis of 
work with adult psychiatric patients who, though not acting out hostility in a gen- 
eralized “‘psychopathic”’ fashion in the community, nevertheless are feeling and 
acting out hostility in a more specific fashion, usually against a family member. In 
our parent sample, we also frequently noted elevations on the Si and D scales in 
profiles with high Pd and low Ma scores, specifically in several parents presenting the 
most severe marital conflict problems. 

Fic. 2. Taz MEAN Rank PRoFILEs For 158 MoruHers AND 104 FaTHers ILLUSTRATING THE RELATIVE 

PROMINENCE OF PosItION OF THE VARIOUS MMPI Sca.gs. 


PROFILE CHART 
H, D H, Psa M Po 


2e0d 
500 


30S 
4.0 
4.5 
500 
505 
6.0 


M 
e 
a 
n 
R 
a 
n 
k 


‘ / 
6.5 oe : a 
| Nie 
720 =e é 
| 
} 
75 
Mean Ranks 
(Men) : 5.4 4.0 4.4 3.9 5.8 df ). ) 7.4 
Mean Ranks 
(Women) §.2 3.5 43 6.6 5.0. 58.! 5. y 5.9 


Figs. 1 and 2 present some interesting contrasts between mothers and fathers on 
the mean T scores and mean ranks on the Mf, Ma and Si scales, where all the mean 
curves show an opposite tendency of the father vs. the mother group. 

The fathers as a group tend to have elevated Mf scores whereas the mothers 
tend to have depressed Mf scores. The clinic staff noted repreatedly that many of the 
mothers with low Mf scores (either absolutely or in relative terms within the client’s 
own profile) were rather disturbed persons and especially difficult to influence thera- 
peutically. To check the impression of low Mf as a disturbance index, we compared 
the frequency of Mf scores below 40 among the following groups of profiles: (a) those 
with one or more clinical scales (exclusive of the Mf scale) in the above 65 T score 
range, (b) those with one or more clinical scales in the 61 to 65 T score range, and (c) 
those wherein the clinical scales were all 60 or below. The tabulated frequencies were 
compared to ascertain the association of low Mf scores with higher elevation pro- 
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files with expected frequency by means of Chi Square which turned out to be 19.7, 
statistically significant and with a probability of less then .01. Our interpretation 
from this comparison was that mothers with the most elevated profiles tended more 
frequently to have Mf scores below 40 than did mothers with less elevated profiles, 
confirming previous impressions that low Mf scores in women are themselves signs 
of emotional disturbance and tend to be associated with high scoring tendencies on 
other MMPI scales. The fathers, on the other hand, have high Mf tendencies and 
there is no evidence of the fathers’ Mf scores being associated with either clinical or 
MMPI signs of emotional disturbance. 
Clinical Classifications. Each child of the parents in this study was classified accord- 
ing to a classification system including the following primary problem categories: 

Rebelliousness at home or school 

Chronic anxiety or fear reactions (excluding school phobia syndrome) 

Habitual manipulations 

Delinquency (adjudicated) 

Primary learning problem or school underachievement 

Primary peer relationship problem 

School phobia syndrome 

Psychosomatic complaints 

Chronic withdrawal 

Enuresis (as primary presenting complaint at the clinic) 

Encopresis (as primary presenting complaint at the clinic) 

Chronic sleep disturbance 


Only rarely was the primary problem the sole complaint, but rather in most instances 
was the problem of paramount concern of the parent in bringing the child to the 
clinic. To discover possible relationships between the parent MMPI findings and 
their children’s problem areas, the authors jointly studied the child’s case record in 
ach case and arrived ai a joint diagnostic sorting or classification of each child. 
From this work it ap; 3; tentatively that parent MMPI profile patterns or eleva- 
tions were associated with particular syndromes or presenting complaints in the 
child patients. There is a tendency for the mothers who have Hy as their highest 
scale and the Mf scale as their own first or second low point to have their children 
classified in category 1 in the scheme, 7.e., with a main problem of ‘“‘rebelliousness at 
home or school’’. Chi Square is 7.008, significant at the 5% probability level before 
Yates correction for continuity but just missing the 5% level after the Yates cor- 
rection. The relationship between high Hy scores alone or low Mf scores alone as 
mothers’ MMPI features and the child’s membership in category 1 just misses 
significance at the 5% level even without application of Yates correction. (The low 
Mf score had previously been seen to occur frequently in the mothers’ profiles as 
had been a high Hy score, hence, these two features were combined and studied as a 
cluster.) 

In addition to the relationship between high Hy scale score and low Mf scale 
score and category 1 in the child problem areas, there also appeared to be a strong 
tendency for children with a diagnosis of ‘“‘school phobia syndrome” to have mothers 
with MMPI codes beginning with 2, z.e., with D as the high scale within the profile, 
although Chi Square again just misses the 5% level. No strong relationships of any 
sort appeared between the fathers’ MMPI profiles and children’s problems. 


SUMMARY 
The MMPI' was administered routinely to parents of children applying for 
treatment at a psychiatric clinic for children. Deviations from the normal appeared 
in the parents’ profiles and there appears to be some association between certain 
parent profiles and certain types of presenting complaints in children. The various 
features of child psychiatric patients’ parents’ MMPI profiles are discussed in detail. 


‘This correction ®) has the function of correcting for small theoretical cell frequencies and is 
especially recommended when the Chi Square without correcting comes out just above customary 
confidence levels. 
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CHANGES IN RECOGNITION THRESHOLDS ASSOCIATED WITH 
CHLORPROMAZINE, PROMAZINE AND PHENOBARBITAL! 


HERMAN Y. EFRON 


Veterans Administration Hospital, Lyons, New Jersey 


PROBLEM 

This study attempts to determine whether fer schizophrenic patients there 
would be any consistent changes in recognition threshold of tachistoscopically pre- 
sented simple stimuli (capital letters) as a result of treatment with two phenothia- 
zines: (chlorpromazine and promazine) and phenobarbital. Primac, Mirsky and 
Rosvold ‘ found that with normal subjects using the Continuous Performance Test 
there was a significant decrement following administration of chlorpromazine. 
Furthermore, using three dosage levels (100, 200, and 400 mg.), they found that 
higher dosages led to greater impairment than lower dosages. Similarly, using a wide 
variety of psychological tests and with normal subjects, Kornetsky, Humphries, and 
Evarts®? found that secobarbital (100 and 200 mg.) and chlorpromazine (100, 200, 
and 400 mg.) significantly impaired performance. Again under higher dosage there 
was greater impairment than with lower dosages. When their subjects were required 
to discriminate between the size of two tachistoscopically presented circles the group 
receiving 200 mg. of secobarbital was more impaired than those given 200 mg. of 
chlorpromazine who in turn were more impaired than those given 100 mg. of seco- 
barbital. On the other hand Lehmann and Hanrahan“? using normal subjects found 
no decrement in reaction time, hand steadiness, digit span, and digit symbol under 
dosages of chlorpromazine ‘“‘sufficiently large to produce somnolence.’’ With 3 gr. 
secobarbital they found uniform impairment in all tested functions. Lehmann and 
Csank “? using normal subjects found that chlorpromazine (100 - 150 mg.) produced 
a decrease in flicker fusion and tapping speed. All of the above experiments dealt 
with nonpsychiatric subjects and with performance after having been given a single 
or at most a few dosages of the drug under study. 

Concerning the effects of chlorpromazine taken for a longer period of time 
Winter and Frederickson “*? found that it leads to ‘“‘some disruption of coordination”’ 


1This study was conducted as an individual hospital project of the Psychiatric Evaluation Project, 
tichard L. Jenkins, M. D., Director. It was done in conjunction with the Veterans Administration 
Cooperative Studies of Chemotherapy in Psychiatry Projects 1 and 2, which are described in detail 
elsewhere “°), This paper was reviewed in the Veterans Administration and is published with the 
approval of the Chief Medical Director. The statements and conclusions of the author are the result 
of his own study and do not necessarily reflect the opinion or policy of the Veterans Administration. 
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particularly in the initial phases of treatment. They caution that . evaluating per- 
formance on tests involving visual-motor coordination, e.g., the Bender-Gestalt, 
account should be taken of concurrent drug therapy. However their study only 
covered a one week period and there was no gradual build-up to the maximum dosage 
of 50 mg. q.i.d. Using the maze test, Porteus“? and Porteus and Barclay“? found 
decrements in performance following treatment with chlorpromazine (300 mg. for 
six weeks to six months), but there have been at least two studies where these results 
were not duplicated“: *). None of the previous studies included testing of recognition 
thresholds. The present study was designed to provide data on a basic psychological 
effect of prolonged treatment with the drugs utilized. Because of the empirical nature 
of the investigation the null hypothesis, that there are no differences between the 
effects of the various drugs, was formulated. 


METHOD 

Subjects. Forty nonlobotomized male patients, less than 50 years old and with a diagnosis of 
schizophrenia were the subjects in this study. The mean age was 39+ 8 and the mean length of 
current hospitalization was 103 + 104 months (Mdn = 109, Q: = 11, Q; = 144). No patient 
included was involved in individual or group psychotherapy, was receiving insulin or electrocon- 
vulsive treatment, or had any disease or illness which contraindicated the use of any of the drugs 
to be administered. Administrative decisions determined the number of patients from each ward 
to be included but after the quotas had been established patients were randomly selected from 
each of the wards. 

Drugs. In order to vitiate the effects of any medication the patients were receiving at the 
time of selection, all tranquilizing medication was discontinued for at least one to two months 
prior to the initiation of the experimental medication. The drug schedule was: chlorpromazine 
200 mg. b.i.d., promazine 200 mg. b.i.d., phenobarbital 114 gr. b.i.d., or placebo b.i.d. To lessen 
the initial impact of the drugs there was a five day gradual build-up period before reaching maxi- 
mum dosage. The drugs were supplied by the Central NP Research Unit of the Veterans Admin- 
istration in identical capsules. No one at the hospital was aware of what drug each patient was 
receiving. After twelve weeks on the drugs there was a randomized cross-over design? in which 
two-thirds of the patients changed drugs and one-third were maintained on the same drug. 

Apparatus. A constant illumination projection tachistoscope was used. Duration of ex- 
posure (10 milliseconds) and ambient illumination (70 footlamberts incident light at the screen 
and at the subject) were held constant and brightness was varied by a rheostat which supplied 
current to the lamp of the projector that contained the stimulus material. Initially the subject 
was presented with a trial series of six numbers in order to acclimate him to the task. The experi- 
mental series consisted of 10 capital letters projected onto a 21’ by 13” beaded screen. Each 
number or letter was projected oes times at the same brightness and if they were not correctly 
identified the rheostat was advanced two volts. Brightness level for the first presentation was the 
same for all subjects. When the subject identified a letter on two consecutive exposures the next 
letter was presented. 

Testing Schedule. Recognition thresholds for the 10 stimulus letters were determined for each 
subject prior to initiation of the experimental medication, after 10-12 weeks on the first drug and 
for a third time 10-12 weeks after the randomized cross-over of medication. 


RESULTS AND DISCUSSION 

Of the 40 patients, 27 were in sufficiently good contact and sufficiently coopera- 
tive to complete at least two of the three testing sessions. Comparisons were possible 
between a patient’s performance after having been on the first drug, and after having 
been on the second drug with his baseline before receiving medication. If a patient 
received the same drug during both parts of the study only his performance at the 
end of the first 10-12 weeks was considered. Using this method of analysis, the 
number of comparisons between a subject’s pre-drug performance and his perform- 
ance after having been on one of the experimental drugs was as follows: chlorpro- 
mazine = 11, promazine = 11, phenobarbital = 9, placebo = 9. 

The recognition threshold for each letter was taken as the mean of the rheostat 
settings (voltages) for the two consecutive correct identifications. For each testing 
session the median recognition threshold was then computed. Mean scores could not 
be obtained since there were a few instances where the subject recognized the letter 
correctly on the first two presentations and other instances where the threshold was 


*F or administrative reasons 8 of the 40 patients were discontinued after the first 12 week period. 
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so high its exact value was not determined. Using the difference between the median 
score of the pre-drug testing and the testing under the drug conditions the following 
comparisons were made: chlorpromazine vs. promazine, chlorpromazine vs. pheno- 
barbital, chlorpromazine vs. placebo, promazine vs. phenobarbital, promazine vs. 
placebo, phenobarbital vs. placebo, chlorpromazine and promazine vs. phenobarbital 
and placebo, chlorpromazine and promazine vs. phenobarbital, and chlorpromazine 
and promazine vs. placebo. Statistical analysis by means of the Mann-Whitney U 
test? PP. 116-127) indicated that none of the differences were significant at the 5% level. 

Our results indicate that for the drugs and dosages used there was no interfer- 
ence with the perceptual process tested, and suggesting that there is a difference be- 
tween the initial effects of the drugs and their effects after prolonged usage. Our re- 
sults are not consistent with the findings of Whitehead and Thune“ who report 
greater cooperation in their chlorpromazine group. Of the four patients who co- 
operated for the initial testing and subsequently refused to cooperate, three were re- 
ceiving chlorpromazine. 

The fact that the phenobarbital group did not show any impairment is somewhat 
surprising in view of previous studies®: 4». Although they used a different barbi- 
turate it would seem that 1) gr. b.i.d. of phenobarbital would at least be equivalent 
to the dosage of Seconal (100 and 200 mg.) used by Kornetsky et al®’. The most par- 
simonious explanation would seem to be that a tolerance is built up with prolonged 
usage of the drug. 


SUMMARY 
Comparisons were made between the effects of chlorpromazine, promazine, and 
phenobarbital on schizophrenic patients’ recognition thresholds of tachistoscopically 
presented simple stimuli (capital letters). Forty subjects were placed on one of the 
experimental drugs or a placebo and at the end of 12 weeks there was a randomized 


cross-over in which two-thirds of the patients changed medication. Results showed 
that there were no significant changes in recognition thresholds following treatment 
with the drugs. When viewed in conjunction with previous studies it suggests that 
impairment of functioning observed at the initiation of drug treatment may not 
persist when treatment is prolonged. 
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EFFECTS OF MORPHINE AND PENTOBARBITAL 
ON DIFFERENTIAL MMPI PROFILES 
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Addiction Research Center, PHS, Lexington, Kentucky 


PROBLEM 

Evidence that altering certain incentives modifies the behavioral effects of 
morphine and pentobarbital was presented in a previous report.) The present 
study was similar in formulation but was specifically concerned with possible cor- 
relations between drug effects and personality characteristics as measured by the 
MMPI. That this inventory may be useful in assessing drug effects has been de- 
monstrated by Belleville in testing LSD-25,®) and by Isbell, et al,“ and Abram- 
son“) in using alcohol. No mention was made in these reports of a possible inter- 
action between personality and drug effect. Some support for this contention, how- 
ever, is supplied by the studies of Lasagna and Beecher? and by Felsinger, et al, 
who reported that the 7 of their group of 20 normal subjects who ‘‘responded un- 
equivocally with euphoria to heroin or morphine, or both’, were the most immature 
of the group as well as being self-centered, impatient, and above the median in 
“anxiety, hostility content’. 

These studies demonstrated that individuals within a ‘“‘normal”’ population may 
respond differently to the same drug. Furthermore, clinical observation suggests 
that even within a relatively homogeneous group of drug adicts differential effects of 
addicting drugs may be expected. The purpose of the present study, then, was to 
investigate systematically the effects of acute administration of morphine and 
pentobarbital on responses to the MMPI, and, within the limits of this instrument to 
assess the degree to which drug effects are dependent upon personality character- 
istics. 

METHOD 


Subjects. Fifty-five former addicts serving sentences for violation of the Harrison Narcotic Act 
volunteered for the study. Approximately equal numbers of white and negro subjects were used. 
The group form of the MMPI was given in balanced order under control, morphine, and pento- 
barbital conditions. Only one test per week was given to avoid residual drug effects. Two intra- 
muscular injections of morphine (30 mg. each) or of pentobarbital (100 mg. each) were given (9:30 
a. m. and 1:30 p. m.). The MMPI was administered at 2:30 p. m. the same day. Standard in- 
structions were given for completing the inventory, except that the subjects were told to respond 
according to the way they felt on the day of the test. An informal atmosphere was encouraged 
and the men were allowed to smoke and make comments to each other. 

Present personality groupings were based upon a “high point’’ classification used for a pre- 
vious investigation in which 94 per cent of the population of addicts presented abnormal profiles 
under drug-free conditions. to distinguish subsamples in this prior study, all abnormal pro- 
files were classified according to the two most elevated scales as falling into one of the following 
subgroups: (a) Psychopathic group: Profiles having Pd and Ma as the highest elevations; (b) 
Neurotic group: Profiles having two highest points among the Hs, D, Hy, and Pt scales; (c) 
Schizoid group: Profiles having two highest points among the Pa, Sc, Ma, and Pt scales; (d) un- 
classifiable group: Profiles that could not be classified by such high-point criteria [The Pd scale 
was disregarded except for group (a)]. The generic term ‘Conduct Disorder’, indicating varia- 
tion in the direction of psychopathy, was applied to these four groups because the Pd scale was 
elevated approximately two standard deviations in all these subsamples. For present purposes in 
classifying drug addicts, however, the terms “Conduct Disorder, Drug Addict,’’ with the various 
subgroupings, have been abbreviated to: Primary psychopath, Neurotic psychopath, Schizoid 
psychopath, and Unclassifiable group. 

Procedure. In the present study the eight clinical scales were ranked for each profile, using Hatha- 
way’s“) coding system. Rank order correlations were then computed between each subject’s 
control MMPI and the mean rank orders of the three classes of addict profiles obtained pre- 
viously.‘?) Percentile ranks of correlation coefficients for all subjects within each classification 
of addicts were then established. In this way all individuals were ranked for similarity of profile 
in each of the three groups, and each subject was ordered to the group for which his percentile 
rank was the highest. In case of a tie on percentile rank, the decision for placement was based on 
the size of the correlation coefficient. If rho was less than .45, the profile was placed in an unclassi- 
fiable group; as in the previous study, most of the profiles in this group were abnormally elevated 
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but could not be classified according to present criteria. Taylor’s“!) Manifest Anxiety Scale (A), 
Welsh’s“®) Anxiety Index (A1), and Internalization Ratio (IR) were also used. The Masculinity- 
femininity (Mf) scale was not employed. 

A mixed analysis of variance design was used to test the effects of personality grou Inge, 
scales, and drugs, and independent analyses of the same type were used to test changes in RK 
and IR, as well as in F + Pd, a combination score designed to measure interpersonal be eden 
which is generally rec ognize xd as being undesirable. A treatments < treatments subjects design 
was utilized to test drug and scale effects within each group. “” 


RESULTS 
The means for all MMPI scales and indices for each of the subgroups are pre- 
sented in Table 1. Application of analysis of variance revealed significant effect due 


TABLE 1. SuMMARY OF MEANS FoR ALL Dav G AND NONDRU aC ONDITIONS 





Ma AI 





Primary Psychopath 
Control 
Morphine 4 ye , §2* 5& 5s : 5 55** 
Pentobarbital . ) 5 5é 53 f - 49+ 


Neurotic Psychopath 
Control 4S 61 6 rh De ¢ 54 66 
Morphine 43 56 y 7 7 56 «62 
Pentobarbital 8 62 Ye 7 ) 5¢ 64 


Schizoid Psychopath 
Control 46 68 5 56 =A Ne j 60 
Morphine i 71 S 57 & 5: 7 61 60 


Pentobarbital 46 66 f 2 56 f if 59 59 





Significantly differ rent from control at < 0° level 
Significantly different from control at <.01 level 
Significantly different from morphine at <.05 level 


to groups, scales, drugs X groups, as well as drugs X scales X groups interactions.’ 
However, it is noteworthy that no significant general effect of drugs was found. 

The drugs, scales, and drugs X scales variances were significant for the Primary 
group. The significant drug effect for this group accounts for the drugs X groups 
interaction found in the analysis of the total sample, since drug effects alone were not 
significant for the Neurotic or the Schizoid groups. The drugs X scales effect in the 
Primary group was further traced to significant increases in the HS, Pa, Pt, and Sc 
scales from control to morphine conditions. The drugs X scales effect in the Neurotic 
subgroup of psychopaths was limited to a significant decrease in the Depression scale 
under morphine. No significant drug or drugs X scale effects were found in the 
Schizoid or Unclassified groups. 

The Anxiety Index and the Internalization Ratio were analyzed separately. As 
shown in Table 1, both were significantly decreased by morphine in the Neurotic 
group and both were significantly increased by this drug in the Primary psychopath 
group. Under pentobarbital, both the Neurotic and Schizoid groups showed signifi- 
cant decreases on IR. The Primary psychopath group was significantly higher on 
IR under morphine than under pentobarbital. Analysis of the F + Pd scores is not 
presented in tabular form. Increases in this measure were significant beyond the .01 
per cent level from control to both morphine and pentobarbital in the Primary 
psychopath, but the obtained decreases in the Neurotic group were nonsignificant. 

Judging from the subjects’ behavior, the dose of morphine was higher than that 
required to produce a measurable effect on the MMPI. In spite of the large dose all 
completed the test, and although some vomited, this effect was described in the 
addicts’ jargon as a “good sick’’. On the other hand, the dose of pentobarbital was 
somewhat lower than could have been used for present purposes. 


1The table of analysis of variance can be obtained from the authors on request. 
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DISCUSSION 


All drug effects obtained were dependent upon personality classification; no 
significant changes on the MMPI scales were produced by the drugs before the sub- 
jects were classified into personality groupings. It would thus seem that combining 
test results from even moderately heterogenous individuals may obscure some psy- 
chological drug effects. 

Significant decreases produced by morphine on the Anxiety Index and Internal- 
ization Ratio in the Neurotic psychopath group is evidence, complementary to 
clinical observations and experimental findings, that opiates reduce fear and anxiety. 
The decrease in elevation of the Depression scale is similar evidence, and adds con- 
siderable credence from an operationally defined source, that certain subsamples of 
psychopaths and addicts have a strong depressive trend that opiates are effective in 
ameliorating. 

The only significant change found in the Schizoid psychopath group was a re- 
duction in the Internalization Ratio produced by pentobarbital. This is in accord 
with clinical observation and general belief that barbiturates relieve some forms of 
anxiety and enhance ‘‘externalization” of conflicts. However, the lack of significant 
change on other indices and scales may have been due to the amount of drug given, 
the size of the sample, degree of sensitivity of the MMPI, or it may have been due 
to heterogeneity of error variance within the group. 

The increased elevation induced by morphine on the Hs, Pa, Pt, and Sc scales in 
the Primary psychopath group must be interpreted with caution. Since all scales 
except Pd and Ma were within the normal range under control conditions, as was 
found in a previous study, this group would not be considered either anxious or 
depressed. It would not be expected, then, that drugs would produce a decrease in 
the neurotic triad or indices of anxiety in the Primary group as was found for the 
Neurotic psychopath. Two alternative explanations, however, are suggested for the 
significant increases that were found. Morphine appears either to increase symptoms 
in the Primary psychopath which are represented by the altered scales and indices, or 
to change the individual so that he is more willing to admit such symptoms and 
bebavior. 

Until direct tests of these hypotheses are made, the latter possibility seems the 
more reasonable in view of three lines of suggestive evidence: (a) Astin,? applying 
factor analysis to the Pd scale obtained on a group of addicts who were maintained in 
a drug-free environment, reported that the Primary psychopath “responds in over- 
perfect ways to the MMPI items and denies any weakness in interpersonal re- 
lations; (b) clinical experience indicates that morphine reduces concern in inter- 
personal relations; (c) the combination F + Pd, utilized in the present study as a 
measure of social behavior generally recognized as being undesirable, was found to be 
significantly increased in the Primary but not in the Neurotic psychopath. The 
evidence is thus suggestive that the above-mentioned tendency toward ‘‘denial’’ in 
the Primary group is reduced by opiates and that this alteration in motivation is 
reflected in the endorsement of MMPI items. 


SUMMARY 


The MMPI was given in balanced order to 55 postaddict subjects under con- 
trol, morphine (60 mg.), and pentobarbital (200 mg.). Drug effects were assessed for 
the total group as well as for four subgroups: Primary psychopath, (N = 12); Neu- 
rotic psychopath, (N = 12); Schizoid psychopath, (N = 12); and Unclassifiable, 
(N = 19). The drugs did not alter scale elevations for the combined group; signifi- 
vant drug effects were dependent upon differential personality characteristics as 
measured by the MMPI. The only significant change under pentobarbital was a 
decrease in the Internalization Ratio in the Schizoid and Neurotic groups. Morphine 
produced unpredicted increases in Hs, Pa, Pt, Sc, and in the Internalization Ratio in 


*Astin, A. A factor study of the MMPI Psychopathic deviate scale. J. consult. Psychol., in press. 
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the Primary psychopath group, and produced predicted decreases in D, the Anxiety 
Index, and Internalization Ratio in the Neurotic psychopath. The latter changes 
were anticipated since it had been hypothesized that one important aspect in the 
development of addiction to opiates is initial reduction of discomfort, anxiety, and 
depression in those addicts who show neurotic tendencies. Although the results sug- 
gest that opiates reduce sensitivity to interpersonal relations in the Primary psycho- 
path, the implications of the findings for addiction liability of the opiates in this type 
of individual are not as yet clearly defined. 
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INTELLECTUAL CHANGES FOLLOWING FRONTAL LOBE 
PROCAINIZATION! 
ALBERT F. PAOLINO AND IRA FRIEDMAN 


Cleveland Psychiatric Institute and Hospital 


PROBLEM 
Post-operative intellectual changes following psychosurgery have been reported 
by various investigators“: 2» §- 4), The present study is addressed to the problem of 
intellectual changes following the psychosurgical (pharmacosurgical) technique of 
injecting procaine into the frontal lobes. 


PROCEDURE 
The sample consisted of 26 patients hospitalized at the Summit County Receiv- 
ing Hospital who received an injection of 20 to 30 cc. of 0.5 per cent procaine solution 
into each frontal lobe in the plane of the coronal suture. A control group of 26 
patients hospitalized at the Cleveland Psychiatric Institute and Hospital was 
matched for age, sex, initial IQ level, and diagnosis. Most of the patients were 
diagnosed as schizophrenic, 17 in the experimental and 18 in the control group. The 


‘Modification of a paper presented at the meeting of the Ohio Psychological Association, Cleve- 
land, Ohio, November 1, 1958. 
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mean age for operatees was 37.73 years, 8.D. of 10.55 and the mean age for the 
control group was 37.61, 8.D. of 9.81. The mean initial 1Q level was 99.92, 8.D. of 
14.19, for the operatees and 99.73, S.D. of 14.96, for the control group. The control 
group received no psychiatric treatment other than routine intake, admission, and 
diagnostic interviews.” 

The Wechsler Bellevue Intelligence Scale was administered twice to each sub- 
ject. In the experimental group, approximately 10 days elapsed between the first and 
second testing, and the second testing was compteted within 48 hours after procaine 
injection. The control group received its second test administration from one to 
three weeks following the first. Three sequences of intelligence test administration 
were employed with the two groups; Form I of the Wechsler Bellevue Intelligence 
Scale for both initial and final testing with 11 patients in each group, Form I then 
Form I{ with 8 patients in each group, and Form II then Form I with 7 patients in 
each group. 

The verbal, performance, and full scale IQs of the two groups were analyzed by 
means of the Mann-Whitney U-test, a non-parametric technique. The t-test was also 
used but it is of doubtful validity due to the skewed nature of the data or its lack of 
homogeneity of variance. 


RESULTS AND DISCUSSION 

Table 1 presents the mean IQ changes of the experimental and control groups 
along with the significance of these changes within each group from first to second 
testing, as well as the significance of the changes between the experimental and 
control groups.’ The results indicate that both groups reveal highly significant in- 
creases in full scale 1Q scores. These increases appear to be more related to increases 
in performance or non-verbal functioning than to increases in the verbal area. In 
addition, the differences tend to be greater and more striking in the experimental 
than in the control group. However, when the changes in the two groups are cum- 
pared, the differences are not significant with the exception of two borderline .05 
TABLE 1. SIGNIFICANCE OF THE IQ CHANGES WITHIN EXPERIMENTAL AND CONTROL GROUPS AND 
BETWEEN EXPERIMENTAL AND CONTROL GROUPS 


Full Scale IQ 
Experimental Control 
Test Order Mean Change t p* p** MeanChange tf p* p 





Total 


+4.88 5.22 .001 .001 +3.12 2.80 .32 NS 
1.15 


88 05 05 +5.88 2.62 .05 5 NS 
62 | NS NS +2.29 1.02 NS NS - NS 
Verbal Scale IQ 
&9 NS 05 1.46 .90 NS NS .36 NS 
22 NS NS — .45 42 NS NS 28 NS 
82 05 01 4.50 1.03 NS NS NS 
82 —.01 —.0l 1.00 .46 NS NS y NS 


N 
+5.82 4.23 01 .O1 +1.64 NS Ng 2. .05 Ni 
N§ 
N& 











Performance Scale IQ 
7.62 6.20 .0O1 .001 3.77 2.86 01 Ol y 05 
8.45 6.39 001 .O1 4.09 1.81 NS NS NS 
5.00 1.99 NS 05 4.88 2.01 NS NS NS 
9.29 3.19 .05 05 2.00 92 NS NS 2 NS 
*McNemar’s Method of Differences 
**N on-parametric analysis, Wilcoxon Matched Pairs Signed Rank Test 





*Five patients in the control group received a placebo taken orally in connection with another 
research project. ; 

’The analysis of individual subtests is not presented. There were no significant differences in any 
of the subtest changes between experimental and control groups. This finding is in essential agreement 
with those of King“ although his controls showed significant improvement in Object Assembly at the 
second post-testing period. 
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p-values suggesting greater increases in the experimental group in performance IQ 
and greater increases in full scale 1Q when identical tests are employed. 

These results indicate clearly that intellectual functioning immediately following 
frontal lobe procainization reveals no evidence of deterioration. The only suggestion 
of intellectual impairment observed was in the 7 experimental subjects who displayed 
a mean verbal 1Q loss of 5 points. These same subjects, however, had a performance 
IQ gain of 9.29 points and a full scale 1Q increase of 2.57 points. 


SUMMARY AND CONCLUSIONS 

This study investigates intellectual changes following the psychosurgical 
(pharmacosurgical) technique of injection of procaine into the frontal lobes. A group 
of 26 operatees was tested with the Wechsler Bellevue Intelligence Scale before and 
after the operation and was compared with a control group matched as to age, sex, 
initial 1Q and diagnosis. The results indicate that no intellectual impairment occurs 
immediately following this operation, that significant practice effects are in evidence 
similar to those revealed by a matched control group, and that there is some sug- 
gestion that the operative procedure may result in greater increases in performance 
IQ than those attributable to practice effects. 


REFERENCES 

1. Dust, S. 8. and Taater, M. The use of psychological tests on schizophrenic patients before 
and after shock treatment. Amer. Psychologist, 1947, 2, 283. Abstract. 

2. Kwsster, Lucius. Intellectual changes in schizophrenic patients following electric shock 
therapy. Amer. Psychologist, 1948, 3, 277-278. 

3. Kina, H. Intellectual function. In F. A. Mettler, (Ed.) Selective Partial Ablation of the Frontal 
Cortex. New York: Paul B. Hoeber, 1949. 

4. Porrsus, 8. D. and Peters, H. N. Maze test validation and psychosurgery. Genet. Psychol. 
Monogr., 1947, 36. 

5. Wuirwortn, J. M., WetnBERGER, L. M., Zrmpier, M. B., and Paotino, A. F. The use of 
frontal lobe procaine injections in the treatment of mental disorder. Ohio St. med. J., 1956, 52, 
727-728. 


‘This group of subjects was younger and less bright than the other experimental subjects. Further 
comparisons, not presented here, indicated trends which suggest that the greatest improvement in 
intellectual functioning concomitant with this operative procedure may occur in brighter and older 
patients who, in addition to performance increases, show verbal increases as well. 


ANXIETY AND SOCIO-ECONOMIC STRATIFICATION 
ERWIN J. LOTSOF AND RICHARD CENTERS 
University of California, Los Angeles 


INTRODUCTION 

Our present-day American culture has been terméd by some insightful observer 
‘“‘the age of anxiety’”’ and, indeed, the study of and concern with anxiety as a psycho- 
logical variable is no longer confined to the clinician or psychopathologist, but enters 
increasingly into the sphere of interest of the social! psychologist. Probably contribut- 
ing to this expanding concern is the relatively great amount of attention given in the 
various mass media of communication to tranquilizing drugs purported to relieve 
anxiety and tension. Several months ago, in the Los Angeles area, it was even rather 
common to see large signs on drug store windows indicating that these drugs were in 
stock, and numerous jokes and quips on radio, television and nightclub shows further 
suggested the widespread interest and use of these anxiety relievers. Such a public 
climate of interest and concern with anxiety suggested the feasibility of testing 
certain hypotheses relating the anxiety factors to other variables by means of a 
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survey approach. Several previous investigations ®: ¢) have suggested that anxiety 
and somatization reactions are related to socio-economic stratification, but in all 
cases these studies have been based upon highly selected samples, 7.e., clinical 
patients, and it was desired to test this relationship with a more adequate sample. 

On the basis of observations and reports in the literature, it was hypothesized 
that individuals in the lower socio-economic strata should show more concern over 
somatic factors than individuals in the higher socio-economic groups. Further, it 
was felt that the higher the educational achievement the fewer should be the somatic 
complaints. Also, it was hypothesized that a sense of personal competence should be 
inversely related to concern with somatic factors. 


MertTHoD 


A questionnaire was constructed to assess the degrees of anxiety, tension, con- 
cern with soma, and personal competence. Some of our anxiety items were derived 
from the Minnesota Multiphasic Personality Inventory, and the measure of feelings 
of personal competence was adapted from a study by Douvan and Walker’. The 
interview schedule also contained questions concerned with socio-economic status, 
educational and other background factors. 

The data for this research are the results of person-person interviews with a 
sample of 1,250 persons, constituting a cross section of the adult population of the 
Los Angeles Metropolitan area. Each person was interviewed alone at his home. 
Both sexes, above 18 years of age, of various racial and ethnic derivation were in- 
cluded. 

The replies to the interview were coded, employing a dichotomizing procedure, 
and phi correlations were obtained for each of the 44 variables encompassed. A 
factor analysis using Kaiser’s Varimax method“) was performed on the data. This 
method is an orthogonal iterative procedure which tends to maximize the variance 
of the squared extended vector projections, giving a solution similar to simple 
structure.' In what follows only variables showing loadings of approximately .30 and 
above will be given importance. The variable of .30 is arbitrary, since the standard 
error is fot known. 


RESULTS AND DISCUSSION 

A total of fourteen factors were identified, but only seven of these seem suffic- 
iently important to warrant description and comment, the other seven tending to 
have fewer loadings on them with relatively lower intercorrelations of the items en- 
compassed. 

Factor 1 ‘‘Anxiety”’ loads mainly on questions pertaining to anxiety, tension and 
concern with physical well-being or bodily health. For example, some of the items 
involved and their loadings are as follows: 

“During the past year, how often have you experienced periods of a rapid heart or pulse; 
very often, fairly often, or hardly at all? (very often and fairly often were scored plus, hardly at 
all was scored minus).”’ Loading .48. 


“During the past year, how often have you had times when you felt all worn out or ‘dead 
tired’; very often, fairly often, or hardly at all?’ (Very often and fairly often were scored plus, 
hardly at all was scored minus) Loading .46. 


“During the past year, how often have you been worried about your health; very often, 
fairly often, or hardly at all?’”’ (Very often and fairly often were scored plus, hardly at all was 
scored minus) Loading .58. 


Our prediction that the anxiety factor should also load on items concerned with 
socio-economic variables was not borne out in our study. 

The second factor, which seems to be mainly socio-economic in nature, loads on 
rated socio-economic status (average and above were scored plus, below average 


1The complete interview schedule, correlation matrix and rotated factor matrix are on file with 
the American Documentation Institute, Washington, D. C. Order Document 5866, remitting $1.25 
for a or 35 mm. microfilm. Make checks payable to Chief, Photo-duplication Service, 


Library of Congress, Washington 25, D. C. 
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scored minus) ; house type rating (excellent, very good and good scored plus; average, 
fair, poor, and very poor scored minus). Dwelling area rating (scored like the pre- 
ceding item). Occupation (white collar occupation were scored plus, manual work 
occupations were scored minus). Class identification (upper and middle class identi- 
fications were scored plus, working and lower class identifications were scored 
minus). There was a moderately high loading of this factor on the item: ‘“Have you 
heard or read about certain drugs or medicines called tranquilizers?” (A ‘‘yes’’ 
answer was scored plus) Possibly it is due simply to the fact. that higher socio-econ- 
omic status persons have more commonly become informed of ataractic drugs than 
those of lower status. 

Factor 3, “religiosity”, loads mainly on questions asking whether the respondent 
was a member of a church (‘‘yes’’ was scored plus); which denomination (Christian 
scored plus, Jewish scored minus), and whether he considered himself devout (scored 
plus) or not devout (scored minus). It had been supposed that religiosity would be 
positively related to anxiety. However, it appears not to be so to any appreciable 
degree, nor is it appreciably related to any of the other variables encompassed in our 
study. 

Another factor, number four ‘feelings of inadequacy” is similar to the proceding 
one in its isolation from other variables. It loads mainly on the following three items. 

“During the past year, how often have you felt very blue or depressed; very often, fairly 
often, or hardly at all?”’ (The ‘‘often’’ responses were scored plus) 

“Problems and difficulties are always coming up in peoples’ lives. To what extent have you 
felt in this past year that problems and difficulties were so great that you might not overcome 
them; very often, fairly often, or hardly ever?” (The ‘‘often’”’ responses were scored plus) 

“People often have feelings that they are not as good as or are inferior to other people; to 
what extent in this past year have you felt this way; very often, fairly often, or hardly at all’’ 
(The “‘often’’ responses were scored plus) 


The fifth factor “Information and Experience” appears to load on having heard 
or read about medicines called tranquilizers, educational level achieved and the age 
of the individual. The factor does not appear to be related to anxiety or other var- 
iables to an appreciable degree. 

Factor 6, “‘Self-confidence’’ loads on the following three questions: “Once you 
decide to do something, how sure are you of the outcome?”’ (‘‘Very sure” and ‘fairly 
sure’’ were scored plus) ‘‘How sure are you usually that your opinions and beliefs are 
right? (‘‘Very sure” and ‘‘fairly sure’”’ were scored plus) and, ‘‘to what extent do you 
feel that you will be able to get the things you want out of life—very confident, fairly 
confident, or not at all confident.’”’ (‘‘Very confident” and ‘“‘fairly confident’’ were 
scored plus) It had been our expectation that there would be an inverse relationship 
between feelings of self-confidence and indicators of tension and anxiety, but such 
did not turn out to be the case. 

The seventh factor loads on items relating to feelings of self-competence or 
optimism: ‘‘Some people feel that their lives are worked out just the way they want- 
ed; others feel that they have really had bad breaks; how do you feel about the way 
your life is turning out—satisfied or dissatisfied?’’ (‘‘Satisfied’’ was scored plus) 
“Some people feel that they can make pretty definite plans for their lives for the 
next few years; others feel they aren’t in a position to plan ahead. How about you— 
do you feel able to plan ahead or not?’ (‘‘Abie to plan ahead” was scored plus) 
“What do you think your chances are of living the kind of life you’d like to have? 
Do you think they are pretty good, or not so good?” (‘‘Pretty good’’ was scored 
plus). We hypothesized initially that these questions which comprise the index of 
personal competence feelings and which concern optimism would load in an op- 
posite direction to the anxiety items and in the same direction as self-confidence. 
Our hypothesis again was not borne out by the data. This finding may cast some 
doubt that feelings of self-confidence and optimism are positively correlated to each 
other and negatively correlated with tension and anxiety. 
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SUMMARY 

This study concerned itself with anxiety and socio-economic stratification, 
both assessed by a questionnaire administered in person-to-person interviews con- 
ducted with a cross section of the adult population of the Los Angeles metropolitan 
area. Correlations were obtained for the variables and then factor analysed, using 
Kaiser’s Varimax method. Although the anxiety items tend to load on the same 
factor, Factor 1, they do not appear to be related to socio-economic status. A rather 
interesting finding was that optimism and self-competence were not inversely related 
to anxiety. It would appear that, on the basis of the present study, anxiety and 
somatization reactions are unrelated to socio-economic status in the general pop- 
ulation. 
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A STUDY OF FACTORS RELATED TO LENGTH OF STAY IN 
PSYCHOTHERAPY* 


M. A. BAILEY, L. WARSHAW AND R. M. EICHLER 


Veterans Administration, New York Regional Office 


PROBLEM 

A previous investigation by the authors“? revealed that the patients applying 
for treatment in the Mental Hygiene Clinic of the New York Regional Office are 
representative of the veteran population in the greater New York area for most demo- 
graphic characteristics. However, assignment to psychotherapy was found to be re- 
lated to normative data indicative of high socio-economic status, intrapyschic com- 
plaints, youth, expressed desire for psychotherapy, recommendations from psycho- 
logical test evaluation, and previous treatment. Conversely, assignment to the 
psychosomatic clinic was found to be related to low socio-economic status, somatic 
complaints, requests for medication, lack of recommendation from psychological 
test evaluation, and no previous psychotherapy. The present investigation is de- 
signed to determine whether the above and other demographic criteria for assign- 
ment are related to length of stay in-psychotherapy or in psychosomatic treatment. 


METHOD 

Items of information were recorded on a personal data card in the process of 
screel. ng patients to provide data relevant to: age, number of years of schooling, 
occupation, religious background, race, diagnosis by psychologists and therapists, 
nature of complaint, treatment requested by patient, treatment recommended by 
the psychologist following psycho-diagnostic testing, and previous treatment exper- 
ience. Approximately one year after the date of testing, the treatment folders of 
these patients were reviewed to determine (a) length of stay in treatment, and (b) 


*The authors wish to express their appreciation to Mrs. Louisa Smith for assistance in collecting 
the data and to Drs. R. H. Paynter and Leon Rostker for many helpful suggestions. 
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amount of improvement. Amount of improvement was determined from the thera- 
pists’ evaluations in their progress and summary notes. It was possible to utilize 
length of stay in treatment as a criterion of improvement because of the very signi- 
ficant relationship obtained between the two, considerably beyond .01 (chi square 
61.23). Length of stay was divided into the following categories: two months or less, 
three to six months, seven to eleven months, and twelve or more months. This 
afforded the investigators a quantitative means of evaluating the effectiveness of 
treatment. Patients were divided into two treatment groups: one group consisted of 
247 psychosomatic patients, and a second group consisted of 211 psychotherapy 
patients. Normative data were then studied in relation to length of stay in treatment 
for each group. Chi squares and Pearson Product Moment correlations were com- 
puted in order to study the above relationships. 


RESULTS 

Psychosomatic Patients. Chi square tests did not reveal any significant relation- 
ships between length of stay in treatment by medication and any of the demo- 
graphic or clinical data studied. However, trends were noted in the relationship 
between length of stay in psychosomatic therapy and level of occupation (p = 19), 
diagnosis by psychologists (p = .10), diagnosis by the therapists (p = 15), treatment 
requested by the patient (p = .08), treatment recommended by the psychologist 
(p = .20). These relationships indicate that patients in lower level occupations, 
patients with psychotic diagnoses, patients requesting medication, and patients 
recommended for medication subsequent to psychological testing tend to continue 
treatment by medication for longer periods of time than those not presenting the 
above characteristics. 

Psychotherapy Patients. Chi square tests did not reveal any significant relation- 
ships between length of stay in psychotherapy and occupational level, religious back- 


ground, diagnoses by psychologists and therapists, nature of complaint, treatment 
requested or psychologists’ recommendations. 

Significant positive relationships were found between length of stay in psycho- 
therapy and number of years of schooling (chi square 32.19, p < .01) and having 
had previous experience in psychotherapy (chi square 11.95, p < .01). There was 
also a small but significant correlation between the length of stay in previous psycho- 
therapy and the length of stay in subsequent psychotherapy (N 118, r .25, p = .01). 


DISCUSSION 

In a previous study by the investigators®) it was found that criteria such as 
relative youth, high educational and occupational level, intrapsychic complaints, 
expressed interest in a particular form of treatment, psychologists’ test reeommenda- 
tions, and previous treatment experience played a significant role in determining the 
type of treatment to which patients were assigned. The present findings strongly 
suggest that most of the criteria utilized for the selection of psychotherapy patients 
are not valid in predicting their length of stay in psychotherapy with the exception 
of number of years of schooling, and previous treatment experience. Similarly, the 
criteria utilized in selecting patients for treatment in the Psychosomatic Clinic are 
not valid in determining their length of stay in treatment. However, some trends 
are noted in predicting length of stay of psychosomatic patients in treatment which 
suggest that theoretical assumptions such as the use of medication for psychotic 
patients, patients with fewer years of schooling, and patients who are somatically 
oriented have some pragmatic value. 


SUMMARY 
Personal data cards and treatment folders of 247 psychosomatic patients and 
211 psychotherapy patients were reviewed to study the relationship between certain 
normative data and length of stay in treatment. A chi square test revealed a highly 
significant relationship between improvement in psychotherapy and length of stay 
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in treatment. Although no statistically significant relationships were obtained be- 
tween normative data and length of stay in treatment in the psychosomatic clinic, 
several trends were found which established some relationship between length of 
stay in treatment and low socio-economic status, diagnosis of psychosis, and somatic 
orientation. Length of stay in psychotherapy was found to be significantly related 
to number of years of schooling, and amount of previous experience in psychotherapy. 
On the basis of these findings it appears that most criteria for assignment used in this 
clinic are not valid in predicting the length of stay in treatment. 
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MULTIDISCIPLINARY VIEWS ON THE PREPARATION OF WRITTEN 
CLINICAL PSYCHOLOGICAL REPORTS: Ill. THE TROUBLE WITH 
PSYCHOLOGICAL REPORTS 
NORMAN TALLENT WILLIAM J. REISS 


Veterans Administration Center Veterans Administration Center 
The George Washington University 


INTRODUCTION 
Many clinical psychologists and their colleagues on the psychiatric team appear 
to share a feeling that the psychologist’s written clinical evaluation is often not as 
effective a contribution to patient care as it might be. To the end of determining 
what improvements should perhaps be made, an extensive survey of opinion about 
the negative features of current psychological reports was conducted. 


PROCEDURE 


Source of Data. An incomplete sentence (or paragraph) root reading “The 
trouble with psychological reports is . . .,”” preceded by the instruction ‘‘Please com- 
plete in as few or as many words as you see fit:’”’ was presented as an “optional” 
exercise on a research form used for another study in the area of clinical psychological 
reporting. This form was mailed to 741 psychiatrists, 433 psychiatric soctal workers, 
and 393 psychologists, all Veterans Administration staff persons. Completed forms 
were received from 606 psychiatrists, 384 psychologists, and 421 social workers, but 
of these only 276 psychiatrists, 233 psychologists, and 203 social workers completed 
the sentence (or more usually, the paragraph) root. 

Categorization of Data. The ordering of the raw data into some sort of categorized 
presentation (see Table 1) was a prodigious task, the transcribed responses number- 
ing 118 pages, single spaced. Differences in manner of expression from respondent to 
respondent at times made it difficult to know where several responses represented 
essentially the same idea, 7.e., properly belonged in the same category, or contained 
some important variations. Nevertheless, it was possible for the investigators jointly 
to consider one response at a time and to translate it into some meaningful expression. 
For example, the statement ‘‘The trouble with psychological reports is . . . they are 
attempts on the part of the examiner to show what he can do rather than what the 
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TABLE 1. CRITICISMS OF PsYCHOLOGIC AL Reports* 











‘Papcbilctits Psychologists Social Workers 





PROBLEMS OF CONTENT 

A. Unnecessary duplication < 

B. Minor relevancy 

C. Offer diagnosis 

D. Offer prognosis 

E. Offer recommendations 

F. Improper emphasis 

G. Too much raw data 

H. Too little raw data 

I. Delete essential material 

PROBLEMS OF INTERPRETATION 

A. Too general (reports do not 
differentiate among patients) 

B. Too speculative 

C. Sooedetien not labeled as such 

D. Assorted irresponsible interpretations 

PROBLEMS OF PsYCHOLOGIST’S ATTITUDE 

OR ORIENTATION 

A. Overabstract 

B. Too authoritative 

C. Exhibitionistic 

D. Not patient oriented 

E. Test oriented 

F. Too theoretical 

G. Not written with practical or useful 

urpose in mind 

H. Miscellaneous (Especially: too intell- 
ectual, lack of interest in communication) 

PROBLEMS OF COMMUNICATION 

A. Defective communication (reasons not 
given) 

B. Expressive deficiencies (Especially: 
deficiencies of terminology and wordiness) 37 
Style deficiency 
Vague, unclear or ambiguous 
Too technical 
Too complex 
Hedge 
Too long 
Too brief 
Poorly organized 

PROBLEMS oF RoE Conpuct 
(Encroachment) 

PROBLEMS OF SCIENCE AND PROFESSION 
(Especially: psychological theory, 
validity of tests, unsettled role of 
psychologist) 
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*Figures represent per cent of respondents in each category who made a negative comment or 
comments about psychological reports. 


patient 7s’’ was jointly placed in the subcategories ‘‘exhibitionistic’”’ and “‘not patient 
oriented.’’ Such a procedure of necessity lends an impressionistic flavor to the data 
thus processed. To estimate the reliability of this method, therefore, the investi- 
gators, after a lapse of more than two months from the origin: il sorting, independently 
sorted 200 randomly selected responses into the categories and subcategories already 
established. The level of agreement between the two investigators was 83%. 


RESULTS AND DISCUSSION 
The tabled data must be examined with two additional limitations in mind. 
Only about half of the respondents who returned research forms took the opportun- 
ity to make comments about psychological reports. There is no way of knowing 
whether those who failed to respond to this item had no negative feelings about re- 
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ports or did not complete the item for some other reason. In any event, the percent- 
ages shown in the table stem only from those who had any negative comment or 
comments to make, not of the entire professions as they are labeled in the table. 
Quite a few responses were positive in nature, some even laudatory of psychological 
reports. But this study deliberately attempted to seek out the negative features of 
clinical reports with a view to publicize them so that they might be reduced in future 
writing. 

The principal issue which psychiatrists take with the psychological report is 
that it contains too much in the way of various forms of irresponsible interpretation. 
The data indicate many ways in which psychological material may be interpreted 
irresponsibly, but the tabled sub-category ‘‘assorted irresponsible interpretations”’ 
refers principally to such matters as overinterpretation, underinterpretation, in- 
sufficient objectivity, reflecting the examiner’s personality, being textbookish, being 
not adequately related to theory, and just plain invalid interpretation. Psychia- 
trists are also greatly concerned with what is here called ‘‘expressive deficiencies’’, a 
term used principally to cover the use of objectionable terminology (primarily jargon) 
and wordiness. Many of them would be happier if more raw data and less speculation 
were included in the report. Many psychiatrists complain that the psychological 
report is unclear (or ambiguous) and too long. A sufficiently large number of psy- 
chiatrists to be reckoned with also seem to be concerned that the psychological report ~ 
reflects encroachment upon other disciplines, primarily their own. The extent to 
which this is so might be greater than the figure in the table suggests since several 
items included under “‘Problem of Content”’, viz “offer diagnosis’, and ‘“‘offer recom- 
mendations” might well reflect concern with encroachment. 

Perhaps the psychologist’s greatest criticism of psychological reports centers 
about the expressive deficiencies, but he is about equally concerned that the psycho- 
logical report might not be written with a practical or useful purpose in mind. The 
psychologists are critical that reports tend to be too general, that is, that they do 
not differentiate among patients, and they also tend to feel that various forms of 
irresponsible interpretation enter the report. Improper emphasis (overemphasis), 
particularly on psychoanalysis, pathology, diagnosis, dynamics, and improper 
emphasis (underemphasis), particularly on strengths and treatment possibilities, is 
yet another criticism. Like the psychiatrists, the psychologists are critical of unclear 
writing and excessive length. Unlike the psychiatrists, however, who complain of 
too little raw data in the report, the psychologists see the report as containing too 
much raw data. A number of psychologists seem to be cognizant of problems which 
center about the science and profession of psychology as opposed to problems of the 
individual clinician. In particular, such matters as deficiencies in supporting psycho- 
logical theory, the validity of tests, and the unsettled role of the psychologist are 
mentioned. 

The social workers join with both the psychiatrists and psychologists in being 
critical of expressive deficiencies. Two-thirds of the social workers who responded to 
the item ‘‘The trouble with psychological reports is . . .’”? made this complaint. Like 
their colleagues on the psychiatric team the social workers are also critical of irres- 
ponsible interpretation in the report and unclearness of writing. A number of social 
workers, like the psychologists and some psychiatrists too, feel that the report is not 
written with a practical purpose in mind. 


SUMMARY 


A large number of psychiatric team members, psychiatrists, psychologists, and 
psychiatric social workers were asked to complete the sentence (or paragraph) root 
“The trouble with psychological reports is... ’ The comments were categorized 
into Problems of Content, Problems of Interpretation, Problems of Psychologist’s 
Attitude or Orientation, Problems of Communication, Problems of Role Conduct, 
and Problems of Science and Profession. Some apparently meaningful patterns of 
criticism were found. 





SOME CAUTIONS CONCERNING THE USE OF CHANGE SCORES 
FRED HEILIZER 


Veterans Administration Hospital, Northampton, Mass. 


The use of initial scores as a control variable or selection device is a proper and 
useful statistical technique. The use of change scores to indicate improvement or 
deterioration of performance is also a proper and useful statistical technique. The 
simultaneous use of these techniques, however, is neither proper nor useful. 

Churchill“ presents a derivation for the correlation between initial scores and 
(a) difference scores, and (b) ratio change scores. The correlation between initial 
scores and difference scores is given by 
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where X = initial score, D = Y — X, and¢ = ox/cy. It can be seen that ryqisa 
function of rxy and ¢. The correlation between initial scores and ratio scores is 
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where X = initial score, 6 = (Y — X)/X, V = ox/Mx, W = oy/My, and A = —1 
if the quantity (V — rxyW) is positive and +1 if the quantity is negative. It can be 
seen that rx is a function of rxy, V, and W. 

It is our purpose to provide a less complex derivation for the above formulae and 
to extend the derivation of [la] to additional experimental situations by utilizing 
Standard Scores. (The additional derivations are not feasible for ratio change scores 
when Standard Scores are utilized, since ratios of infinity occur. This, in itself, is a 
pertinent comment on the use of ratio change scores.) In so doing, we make the 
assumption that a correlation is utilized solely to test an hypothesis of co-relation; 
it is not intentionally utilized to test an hypothesis of group differences on such 
parameters as means or variances. Therefore, the uses and limitations of a particular 
correlation are most properly indicated by utilizing variables with equated means 
and variances, e.g., Standard Scores. In the presentation to follow, r refers to the 
general correlation formula while 7’ refers to the correlation formula which is derived 
by the use of Standard Scores. 

If we are interested in the relationship between initial scores and difference 
scores, where X = initial score and D = Y — X, 
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Formula [2b] indicates that when a correlational hypothesis is tested between 


initial scores and difference scores the 
either zero or negative. 


correlation cannot be positive; it must be 


If we are interested in the relationship between initial scores and ratio change 


scores, where X = initial score and @ = 


2X? — (Ma) (My) 
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Txo —- 
ox06@ 


When X and Y are Standard Scores, 
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Formula [3b] indicates that when a correlational hypothesis is tested between 
initial scores and ratio change scores the correlation is zero. 

If we are interested in the relationship between two sets of difference scores 
which contain a variable in common, where D = Y — X and E = Z — Y, 


SDE 
—— — (Ma (M.) 
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Tl'de = 
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When X, Y, and Z are Standard Scores, 
ZDE/N = Z(XY + 
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Formula [4b] indicates that r’ae cannot be positive; it can only be zero or 
negative. Furthermore, if rxy = yz = Ixa = I, 


hes Pe em tee 
since a= VEt+%— 2TxyoxTy 





d = [2 2 
an To= WV +s — Qysoyoe 


then [4b] becomes 
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Formula [4c] indicates that r’ae 
in which case r’de = O. 


= V 2(1 — ray) 
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[4c] 


— .50 for all values of r, except whenr = +1, 
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A more complex situation results if we are interested in the relationship between 

difference scores and a control variable other than the initial score variable, where 
C = control score, and D = Y — X, 


— (M.) (Ma) [5a] 


When C, X, and Y are Standard Scores, 
_ 2CD/N 


2[C(Y — X)]/N _ ZCY/N — 2CX/N 


Od Od 


[5b] 


Formula [5b] indicates that, as rx approaches + 1, it becomes increasingly more 
likely that r’.q will be negative; as rex approaches — 1, it becomes increasingly more 
likely that r’.a will be positive. It can also be seen that [2b] differs from [5b] only in 
the substitution of rex in the numerator of [5b] for 7 in the numerator of [2b]. 
Therefore, the more highly correlated are the initial and control scores, the more 
alike are the two formulae. Further, one might expect to obtain strikingly different 
values between r’.a, on an @ prior? level, for widely differing values of rex. 

The above derivations indicate that spurious correlations may result when 
change scores are utilized. When the change scores are correlated with a redundant 
variable, as in [2a], [3a], and [4a], the spuriousness seems obvious. However, when a 
redundancy may not be apparent, as in [5a], then careful attention must be paid to 
the hypothesis being tested and to the implications involved. 
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THE EFFECT OF TIED RANKS ON THE RANK ORDER CORRELATION 
FRED HEILIZER 


Veterans Administration Hospital, Northampton, Mass. 


The purpose of this paper is to indicate a potential bias in the rank order 
correlation when ties are invoived. The notation used here is that used by Siegel. 

The following formulae are appropriate for computing the rank order correlation 
when there are no tied ranks (r, refers to the Spearman rank order correlation and, 
where X and Y are ranked variables, x = Mx — X, y = My — Y,d = X — Y): 


=x? + Ly? — Xd? 22x? — Xd? Yd? 


rs = = - =l1- ; [1] 
2 \ 2x*Zy? 


Q>x? 2>x? 


Let us assume, for convenience, that the correction for tied ranks, T = (t* — t) / 
12), where t = the number of observations tied at a given rank, is the same in X and 
Y. Then, where >x¢ = =x? — ZT, rg = 1 — (Ed*?/2Ex2). Since (Zd?/2>x:) > 
(2d? /2>x?), the correction results in a change of all correlations toward the negative 
side of the r, distribution, thus indicating that the effect of tied ranks, when no cor- 
rection is used, is to bias r, toward the positive side. The following recommendations 
are appropriate when tied ranks occur: (a) the correction should be applied when a 
positive correlation is significant but need not be applied when it is not significant; 
(b) the correction should be applied when a negative correlation is not significant but 
need not be applied when it is significant; (c) where the significance level is deter- 
mined from a table of =d?, the occurrence of tied ranks invalidates the use of the 
table within the limits set forth in (a) and (b). 
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HYPOCHONDRIASIS AND SOMATIC SEIZURE AURAS 
BERNARD 8S. AARONSON 
New Jersey State Hospital at Trenton 


Application of the principle of stimulus generalization to personality data sug- 
gests that one source for the generation of traits may be the evocation of a response 
by a class of stimuli after the response has been elicited by a member of that class. 
Such a response should become more probable when any stimulus belonging to the 
same group appears at any level of intensity. In the case of epileptics, it would be 
anticipated that those whose seizures are preceded by gastric auras, paresthesias, or 
auras involving pain (somatic auras), should show more hypochondriacal concern 
than those with visual, auditory, psychic, kinaesthetic, or motor auras (nonsomatic 
auras), or no auras at all. Individuals with somatic auras should obtain profiles on 
the Minnesota Multiphasic Personality Inventory in which the Hs scale has a higher 
rank than in those obtained by individuals with nonsomatic or no auras at all. 


MetTHOoD 
All MMPI profiles of epileptics were extracted from the files of the psychology 


department at New Castle State Hospital and all profiles with more than 20 items 
left unanswered or with any of the validating scores equalling or exceeding a T score 
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of 70 were eliminated. Individuals whose seizures occurred during sleep were also 
eliminated. The remaining sample was divided into somatic, nonsomatic, and no 
aura groups which were matched for age and sex since it has been shown“: 2?) that 
Hs is affected by both of these variables. This left three groups of six males and four 
females with meari ages of 28.9, 29.1 and 30.0 years in the somatic, nonsomatic and 
no aura groups respectively. These values are not significantly different. The scales 
were ranked in descending order and the rank of the Hs scale determined in each 
profile. All differences were tested by means of an analysis of variance of ranks. 
MMPI ranks rather than scale scores were used as the measure because the 
additional symptom of the aura might affect the absolute scale score. Ranks repre- 
sent the relative strengths of the several response tendencies represented by the 
MMPI scales. An alteration in rank reflects not only an alteration in the frequency 
of responses emitted which are classifiable under a given scale name, but also an 
alteration relative to the frequency of responses classifiable under other scale names. 
Unless the absolute scale scores are of practically equal height, the ranks are less 
likely to be materially affected by the addition of one or two points to a scale score. 


RESULTS AND DIscussION 

The mean rank on Hs of the somatic group was 4.4, of the nonsomatice group, 
6.5, and of the no aura group, 7.0. It should be noted that only a ranking from one 
to ten is possible in a conventional MMPI profile. An analysis of variance of ranks 
over the whole data yields a chi square of 6.45, significant beyond the .05 level with 
two degrees of freedom. Comparison of the somatic and no aura group yields a chi 
square of 8.10, which is significant beyond the .01 level with one degree of freedom. 
A trend seems present for Hs to have a higher rank in the somatic than in the non- 
somatic group, which does not attain significance at the .05 level. No difference seems 
present between the nonsomatic and no aura groups. 

Epileptics with somatic auras seem to have more hypochondriacal concern than 
those with no auras. A trend seems present for them to show more somatic concern 
than those with nonsomatic auras, while the nonsomatic and no aura groups seem to 
differ only in a random manner. All of the patients in this study suffer from a chronic 
illness for which they have had to be hospitalized and all suffer from major seizures 
which come on only while they are awake. The only variable with respect to which 
they differ is the occurrence of the aura, which is a relatively mild experience com- 
pared to that which it foreshadows. The fact that this minor stimulus is associated 
with significant variation in the rank of the Hs scale suggests the role of stimulus 
generalization in the development of personality traits. A persistent symptom may 
elicit responses which then become more probable for the entire class of stimuli of 
which this stimulus is an instance. 


SUMMARY 
Epileptics with somatic seizure auras obtain significantly higher ranks on the 
MMPI Hs scale than do those with no auras. A trend is also noted for those with 
somatic seizure auras to have higher ranks than those with nonsomatic auras. The 
nonsomatic and no aura groups do not seem to differ significantly. The role of stim- 
ulus generalization in the generation of personality traits is suggested. 
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THE SHIPLEY-HARTFORD SCALE AND THE DOPPELT SHORT FORM 
AS ESTIMATORS OF WAIS IQ IN A STATE HOSPITAL POPULATION 
LLOYD K. SINES AND HELEN SIMMONS 
Fergus Falls State Hospital, Minnesota 


PROBLEM 

The senior author“) has reported the use of the Shipley-Hartford scale in 
estimating Wechsler-Bellevue scores in two VA populations. Further, several in- 
vestigators (Himelstein®?, Olin and Reznikoff®), have evaluated the predictive 
accuracy of Doppelt’s“) short form WAIS in psychiatric settings. Because the 
conversion of Shipley scores to Wechsler equivalents among incoming patienis at 
this hospital yielded consistent over-estimates when the tables developed in the 
VA settings were used, and in order to compare the efficiency of the Shipley scale 
with that of a validated ‘“‘short cut’? to WAIS estimation, the two methods were 


evaluated on a randomly selected sample of patients admitted to the Fergus Falls 
State Hospital. 


PROCEDURES AND SUBJECTS 

All patients between the ages of 15 and 65 admitted to the hospital were screened 
for reading ability' and general (physical, psychiatric) amenability to psychological 
testing. 

Approximately 45% of all patients admitted during a consecutive 15 month 
period met the criteria established, and were given a routine battery of psychological 
tests, one of which was the Shipley-Hartford scale, within the first week of hospital- 
ization. From that group of patients a random sample of 30 was selected serially, 
and each patient was given a full (eleven subtest) WAIS within one week of the time 
at which the group tests (including the Shipley) had been completed. 


The patients selected ranged in age from 16 to 62 (mean 40.3, sigma 13.5), and 
included 19 psychotics (12 schizophrenics), 4 neurotics, and 7 of miscellaneous 
diagnoses. 


RESULTS 

WAIS full scale IQ ranged from 67 to 130 (mean 96.1, sigma 14.5), while Shipley 
total scores ranged from 13 to 70 (mean 37.0, sigma 17.6). Intercorrelations between 
the WAIS and Shipley subscales indicated that Shipley total score (uncorrected for 
omitted vocabulary items) was the best predictor of WAIS full scale 1Q. The product 
moment correlation coefficient obtained between the latter two variables was .90, 
and the standard error of estimate in predicting WAIS IQ from Shipley total score 
was 6.33. Employing Wechsler’s scheme of intellectual classification, it was found 
that the estimate of WAIS IQ made from Shipley total score correctly classified 
eighteen (60%) of the thirty subjects. 

Doppelt short form IQ estimates were obtained, and the correlation between 
such estimates and actual WAIS full scale IQ was found to be .960. The standard 
error of estimate of IQ from the Doppelt formula was 4.06, while the accuracy of 
classification of the latter method was 67%. These figures are in substantial agree- 
ment with those reported by others in previous evaluations of the Doppelt formula. 

Though estimation of WAIS full scale IQ by the Doppelt short form method 
tends to yield somewhat better predictions than does estimation from the Shipley- 
Hartford, the fact that scores on the latter scale are highly related to WAIS scores, 
and in addition, the fact that the scale requires no professional time for administra- 
tion and scoring, justifies its use as a brief measure of intellectual functioning. There- 
fore, a conversion table for relating Shipley total scores (the best predictor) to 

1The Stanford-Binet item Reading and Report (Form L, X, 2) was routinely administered. If the 
subject met the reading criterion for that item (not more than two errors), he was judged suitable for 
psychological testing requiring reading. 
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WAIS full scale 1Q was constructed by solving the derived regression equation’. 
Table 1 provides the obtained values. 


TaBLE 1. Wars Fut Scare IQ EqutvaLents ror Suiptey-Hartrorp Toran 
Scores DERIVED FROM THE Strate HosprtaL SAMPLE 
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SUMMARY 


Patients between the ages of 15 and 65 who were admitted to the Fergus Falls 
State Hospital during a 15 month period were screened for reading ability and 
amenability to psychometric assessment. Thirty such patients who had taken a 
routine battery of psychological tests (including the Shipley-Hartford scale) were 
randomly selected, and were given a complete WAIS within one week following 
screening. 

WAIS and Shipley scales were intercorrelated, and it was found that Shipley 
total score was the best predictor of WAIS full scale 1Q (r = .90). Doppelt short 
form WAIS IQ estimates were also obtained and were found to correlate .96 with 
actual WAIS full scale IQ. Though it was concluded that the Doppelt formula pro- 
vides more accurate estimates of WAIS IQ, in view of the very high degrees of re- 
lationship between Shipley and WAIS scores, and because of the ease and economy 
of administering and scoring the latter, a conversion table was provided whereby 
WAIS full scale 1Q equivalents may be obtained for Shipley total scores. 


REFEREN<ES 
1. Doppsrtt, J. E. Estimating the full scale score on the Wechsler Adult Intelligence Scale from 
scores on four subtests. J. consult. Psychol., 1956, 20, 63-66. 
2. Hrenster, P. Evaluation of an abbreviated WAIS in a psychiatric population. J. clin. 
Psychol., 1957, 13, 68-69. 
Outn, T. D., Rezntxorr, M. The use of Doppelt’s short form of the Wechsler Adult Intelligence 

Scale with psychiatric patients. J. consult. Psychol., 1957, 21, 27-28. 
4. Stes, L. K. Intelligence test correlates of Shipley-Hartford performance. J. clin. Psychol., 
1958, 14, 399-404. 


TWAIS IQ = (.7463) (S-Hiotal acore) + 68.455. 


score 








EDITORIAL OPINION 





ACADEMIC PSYCHOLOGY: A SEMANTIC MORASS? 


This editorial was stimulated by a chance remark of a prominent university 
psychologist who siated that he had counseled his son from taking psychology 
courses at the university while an undergraduate because he did not wish the son’s 
high intuitive interest and understanding of behavior to be stultified by the particu- 
lar academic orientation of that department. While it may be invalid to refer to the 
“academic orientation’’ of a department composed of many members presumably 
holding different viewpoints, there is nevertheless some truth in the observation that 
many departments have acquired reputations for the types of psychology taught 
there. One of the difficulties lies in the fact that all of the different systems of 
psychology have developed distinctive vocabularies, the comprehension of which 
constitutes a formidable exercise in semantics. 

The student’s plight is not relieved by the attempts of textbook writers to 
acquaint the beginner with all psychological systems in one bulging introductory 
text. Recently we have had occasion to review the content of the introductory 
psychology course at a leading college. In the first semester, the beginner was re- 
quired to master the vocabularies of experimentalism, Behaviorism, Gestalt psy- 
chology, psychoanalysis, elementary statistics; and of a large number of illustrative 
experiments. By actual comparison, the 1959 introductory course was requiring a 
student to master materials given in the first graduate year 20 years ago and some of 
the examination questions were exact duplicates of Ph.D. preliminary examination 
questions of 20 years ago! Very few undergraduates are capable of comprehending 
much less digesting such an excercise in semantics except for those with very high 
memory ability who can parrot back the words. 

At the rate at which complications of vocabulary are being introduced into 
academic psychology, future generations of psychologists will be required to devote 
their entire life spans to mastering the semantic intricacies of all the theoretical 
constructs in the field. Indeed the student is confronted with a semantic morass 
when he attempts to digest many systems as Hullian learning theory, Allport’s event 
structure theory, Kelly’s personal construct theory, Cattell’s factorial theory, or 
modern decision theory, to cite a few of the worst offenders. We have here the 
dangerous situation where a student must devote more time to mastering the sem- 
antic complexities of a scientific vocabulary than he has available for directly exper- 
iencing the raw data of the science. 

Human behaviors occur in almost infinite variety and it is not feasible to coin 
a new vocabulary to describe each new personality concept or clusterings of etio- 
logic factors as may be discovered. To do so would intolerably deepen the semantic 
morass we are already entangled in. What is needed is a rigorous application of basic 
English to scientific communications of all types with the adoption of a basic vocabu- 
lary to which new terms would not be added except when no simpler suitable term 
was available. Journal editors could facilitate such a solution by rigorously cutting 
away semantic confabulations and excluding needless complexities. Indeed, this 
editor must confess that after 30 years in the field he is constantly encountering 
manuscripts which are comprehensible only by taking out an inordinate amount of 
time to laboriously study out the intricacies of some new semantic jungle which an 
author has staked out for himself. It might be very revealing to request a representa- 
tive group of older academic psychologists to take comprehensive examinations cover- 
ing all of the complexities of theory which have recently appeared. 

The significance of all this for clinical psychology is that promising students in 
the clinical field should not be excluded from advanced study simply because they 
cannot master the semantic jungle which is academic psychology. There are many 
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students who have a genuine intuitive understanding of behavior, and who are by 
temperament suited for clinical work, but who may not have the semantic abilities to 
comprehend modern academic psychology. We would not advocate for a minute 
that academic psychologists be deprived of the right to pursue their semantic com- 
plexities. But we would insist promising clinical students should not be deprived of a 
career simply because they cannot master the semantic intricacies of academic 
psychology. It is entirely possible that schools of clinical psychology should develop 
their own curricula consisting of materials of proven clinical significance. Such 
curricula would expose the student to the complexities of academic theery but would 
not make promotion dependent upon ability to master them. 

The situation would not be so serious if any eternal verities of genuine signi- 
ficance were being communicated in terms of such semantic obscurities since it could 
then be claimed that the student was being forced to “dig it out for himself’’. Too 
often, such is not the case. This editor has long since developed the habit of evaluat- 
ing every sentence of a manuscript to discover whether anything of significance was 
being communicated and, if so, whether it is being expressed in the simplest opera- 
tional terms and with a minimum of excess verbiage. If most of the current literature 
could be evaluated in such terms, it must be concluded that much semantic ado is 
being made over a lot of facts of very limited significance. 

It should be the obligation of our senior colleagues to ‘‘boil down”’ the plethora 
of current semantic complications into the simplest operational terms. Almost 30 
years ago, the writer struggled through the basic literature of Gestalt psychology, 
encountering a very difficult presentation semantically, but eventually coming to 
comprehend the basic contributions by translating them into simple operational 
terms. Indeed, the essence of Gestalt psychology is not the semantic complexities in 
which the original presentations were couched, but rather a series of ingenious exper- 
iments and observations which can be very simply stated. Similarly, the essence of 
more recent contributions is not in the semantic complexities in which authors seem 
to feel compelled to complicate the issues, but rather in a set of relatively simple 
operational viewpoints which can be set forth in simple operational terms. 

The essence of psychology is behavior. The theories, concepts, operational 
methods and vocabularies which have developed in such complexities are strictly 
superfluous unless they deal with and communicate the issues both validly and 
economically. There are many factors operating on the current psychological scene 
tending to perpetuate and even to proliferate contemporary theoretical and semantic 
excesses. If our tremendously complicated field is to survive expeditiously, it must 
be streamlined by a rigorous paring down, at least on teaching and applied levels, of 
all excess baggage. Promising clinical students must not be failed because they can- 
not escape from semantic morasses. 


y. GF. 
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pital with instructions to “observe.” Its broad orien- 

tation operates against freezing the psychologist in 
his traditional role of technician. A stimulating and 
different view of a training program for the clinical 

ITLES psychologist in a mental hospital EMPHASIZES 
THE ROLE OF THE PSYCHOLOGIST AS A 
STUDENT OF BEHAVIOR AND SOCIAL PRO- 
GRESS. Doctor Luchins puts into focus ways to get 

acquainted with a mental hospital, not only inside 

but even outside the hospital walls. The trainee sees 

the whole hospital intimately rather than a part 
well before he ever takes the Rorschach cards out 


of the kit. Quoting a reader... “‘Luchins’ program 
AMERICAN gives students a healthier respect for the kind of 
problems which nurses, aides, social workers, and 
psychologists have to face in a hospital. And once 
having realized this, and gotten to know these other 
hospital people, it would seem that test reports 
LECTURES would contain less priv ate jargon and more useful, 
shareable observations. 


Pub. Sept. ’59, 334 pp., $7.50 


IN DRIVE STRUCTURE AND CRIMINALITY: 
Criminobiologic Investigations 4y Hans Walder, 
Attorney General, Zurich. Edited by Lipot Szondi 
and translated by Marvin W. Webb. By means of 
the most highly developed techniques of criminal 
investigation and personality exploration employed 
in Western Europe, the psychodynamics of crim- 
inality are pursued to their innermost recesses within 
the human psyche. The global personality of the 
Edited by offender is gleaned upon a longitudin al basis, chiefly 
cg the application of the Szondi test, a tech- 
_ nique favored by Continental jurists. The Szondi 
MOLLY HARROWER, Ph.D. test is particularly instructive in portraying the 
unique constellations of drive traits and tendencies 
which are interrelated to the several types of offenses. 
Psychologist, New York It is clearly demonstrated that a given offender is 
capable of committing only certain categories of 
offenses and not others. This work fosters improved 
Clinical Psychology in the methods of crime detection. It seeks to illumine the 
more difficult but more significant areas of punish- 
ment, reformation, and crime prevention. Doctor 
Temple University School of Hans Walder meets in full measure the criterion that 
Medicine this comprehensive theory of criminogenics shall be 
: presented by a criminologist of repute 


Philadelphia, Pennsyloania Pub. Aug. °59, 192 pp., 164 ill., $7.50 
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RESEARCH AND TRAINING 


THE PROGRESS MADE by the children entrusted to Dev- 
ereux’s care is the first concern of every professional member of 
the Schools’ staff. 


Devereux’s program of evaluation, therapy, and education 
gives excellent opportunity for research and for the training of 
professional personnel. 


The Institute for Research and Training of the Devereux 
Foundation provides pre-doctoral and post-doctoral fellowships, 
internships, and traineeships in psychology, psychiatry, psy- 
chiatric social work, and education. These are designed to 
contribute to the understanding of the care, education, training, 
and treatment of the emotionally disturbed and the mentally 
retarded child in a residential setting. 


Professional inquiries about the program should be addressed 
to Dr. Henry Platt or Dr. George Spivack, Director of Research, 
Devereux Schools, Devon, Pennsylvania. 
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